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BACKGROUND OF THE INVENTION 



The present application claims the priority of co-pending U.S. Provisional Patent 
Application Serial No. 60/038,118, filed February 20, 1997, incorporated herein by reference in 
its entirety without disclaimer. The government owns rights in the present invention pursuant to 
grant numbers R01-CA 67274 and R01-EY 06195 from the National Institutes of Health, and 
grant number ATP00494901 8 from the Texas Higher Education Coordinating Board. 

1. Field of the Invention 

The present invention relates generally to the field of molecular and cellular biology. 
More particularly, it concerns modifications of the retinoblastoma tumor suppressor. The present 
invention further relates to the use of the instant modified retinoblastoma tumor suppressors in 
situations where providing a tumor suppressor or normal cell growth suppressor is indicated. 

2. Description of Related Art 

Cancers and tumors are the second most prevalent cause of death in the United States, 
causing approximately 450,000 deaths per year. One in three Americans will develop cancer, 
and one in five will die of cancer (Scientific American Medicine, part 12, I, 1, section dated 
1987). While substantial progress has been made in identifying some of the likely environmental 
and hereditary causes of cancer, the statistics for the cancer death rate indicates a need for 
substantial improvement in the therapy for cancer and related diseases and disorders. 

A number of genes have been implicated in the etiology of cancer. These genes have 
been identified in connection with hereditary forms of cancer, and in a large number of well- 
studied tumor cells. Study of cancer genes has helped provide some understanding of the process 
of tumorigenesis. While a great deal more remains to be learned about cancer genes, the 
presently known cancer genes serve as useful models for understanding tumorigenesis. Cancer 
genes are broadly classified into "oncogenes" which, when activated, promote tumorigenesis, and 
"tumor suppressor genes" which, when damaged, fail to suppress tumorigenesis. While these 
classifications provide a useful method for conceptualizing tumorigenesis, it is also possible that 



a particular gene may play differing roles depending upon the particular allelic form of that gene, 
its regulatory elements, the genetic background and the tissue environment in which it is 
operating. 

5 The oncogenes are somatic cell genes that are mutated from their wild-type alleles (the art 

refers to these wild-type alleles as protooncogenes) into forms which are able to induce 
tumorigenesis under certain conditions. There is presently a substantial literature on known and 
putative oncogenes and the various alleles of these oncogenes. For example, the oncogenes ras 
and myc are considered as models for understanding oncogenic processes in general. The ras 
10 oncogene is believed to encode a cytoplasmic protein, and the myc oncogene is believed to 
encode a nuclear protein. Neither the ras oncogene nor the myc oncogene alone is able to induce 
H; full transformation of a normal cell into a tumor cell, but full tumorigenesis usually occurs when 
^ both the ras and myc oncogenes are present and expressed together in the same cell (Weinberg, 
FU 1989). Such collaborative effects have been observed between a number of other studied 
jfe5 oncogenes. 

" The collaborative model of oncogene tumorigenesis must be qualified by the observation 

jfy that a cell expressing the ras oncogene that is surrounded by normal cells does not undergo full 
[ft transformation. However, if most of the surrounding cells are also ras-expressing, then the ras 
ff) oncogene alone is sufficient to induce tumorigenesis in a ray-expressing cell. This observation 
validates the multiple hit theory of tumorigenesis because a change in the tissue environment of 
the cell hosting the oncogene may be considered a second hit. An alternative and equally valid 
hypothesis is that events that collaborate with the activation of an oncogene such as ras or myc 
may include the inactivation of a negative regulatory factor or factors, i.e., a tumor suppressor 
25 protein (Weinberg, 1989; Goodrich et a/., 1 992a). 

Tumor suppressor genes are genes that, in their wild-type alleles, express proteins that 
suppress abnormal cellular proliferation. When the gene coding for a tumor suppressor protein is 
mutated or deleted, the resulting mutant protein or the complete lack of a tumor suppressor 
30 protein may fail to correctly regulate cellular proliferation. This can lead to abnormal cellular 
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proliferation, particularly if there is already existing damage to the cellular regulatory 
mechanism. The lack of control of cellular proliferation has been linked to the development of a 
wide variety of human cancers (Weinberg, 1991). A number of well-studied human tumors and 
tumor cell lines have been shown to have missing or nonfunctional tumor suppressor genes. 

Examples of tumor suppressor genes and candidate tumor suppressor genes include, but 
are not limited to, the retinoblastoma (RB) gene (Friend et al, 1986; Fung et al, 1987; Lee et al, 
1987a), the wild-type p53 gene (Finlay etal, 1989; Baker etal., 1990), the deleted in colon 
carcinoma (DCC) gene (Fearon etal, 1990a; 1990b), the neurofibromatosis type 1 (NF-1) gene 
(Wallace etal, 1990; Viskochil etal, 1990; Cawthon etal, 1990), the Wilms tumor (WT-1) 
gene (Call et al, 1990; Gessler etal, 1990; Pritchard-Jones et al, 1990), the von Hippel-Lindau 
(VHL) disease tumor suppressor gene (Duan et al, 1995), the Maspin (Zou et al, 1994), Brush- 1 
(Schott et al, 1994) and BRCA 1 genes (Miki et al, 1994; Futreal et al, 1994) for breast cancer, 
and the multiple tumor suppressor (MTS) or pi 6 gene (Serrano etal, 1993; Kamb etal, 1994). 
The list of putative tumor suppressor genes is large and growing, with the total number of tumor 
suppressor genes expected to be well beyond 50 (Knudson, 1993). 

The first tumor suppressor gene identified was the retinoblastoma (RB) gene, which 
causes the hereditary retinoblastoma (Knudson, 1971; Murphree and Benedict, 1984; Knudson, 
1985). The retinoblastoma (RB) gene, which was cloned in the middle 1980s, is one of the best 
studied tumor suppressor genes. The size of the RB gene complementary DNA (cDNA), about 
4.7 kb, permits ready manipulation of the gene, and has led to the insertion of the RB gene into a 
number of cell lines. The RB gene has been shown to be missing or defective in a majority of 
retinoblastomas, sarcomas of the soft tissues and bones, and in approximately 20 to 40 percent of 
breast, lung, prostate and bladder carcinomas (Lee etal, WO 90/05180; Bookstein etal, 1991; 
Benedict etal, 1990). 

The most direct proof that the cloned RB gene is indeed a tumor suppressor gene is the 
observed recovery of tumor suppression function in RB-minus tumor cells from the introduction 
of a cloned intact copy of the RB gene. A number of reports have indicated that replacement of 



the normal RB gene in RB-defective tumor cells from disparate types of human cancers could 
suppress their tumorigenic activity in nude mice (Huang etah, 1988; Goodrich and Lee, 1993; 
Zhou etah, 1994b). The tumor cell lines studied were derived from widely disparate types of 
human cancers such as the retinoblastoma, osteosarcoma, carcinomas of the bladder, prostate, 
breast and lung. 

While it was observed that introduction of a functional wild-type, full-length 
retinoblastoma gene (RB 110 ) into an RB-minus tumor cell "normalizes" the cell, it was not 
expected that tumor cells which already have normal RB 110 gene expression ("RB + ") would 
respond to RB 110 gene therapy, because it was presumed that adding additional RB expression 
could not correct a non-RB genetic defect. This has in fact been shown for the case of the RB + 
osteosarcoma cell line U-2 OS, where the introduction of an extra pllO 116 coding gene did not 
change the neoplastic phenotype (Huang etah, 1988). Thus, there remains a need for a broad- 
spectrum tumor suppressor gene for treating abnormally proliferating cells having any type of 
genetic defect. 

The RB 11U cDNA open reading frame sequence (McGee et ah , 1 989) contains a second 
in-frame AUG codon located in exon 3, at nucleotides 355-357. The protein initiated from this 
second AUG codon lacks the N-terminal 1 12 amino acid residues of the full-length RB protein, 
and is termed pRB (Xu etah, 1994b). In U.S. Patent 5,496,731 (incorporated herein by 

94 

reference), the inventors showed that RB-defective tumor cells expressing exogenous pRB did 

3 

not progress through the cell cycle, as evidenced by their failure to incorporate [ H] -thymidine 

into DNA. In contrast, the percent of tumor cells undergoing DNA replication were only slightly 
i • ii i . 110 

lower m cells producing the exogenous pRB (the wild-type pRB protein) than in cells that 

__ 94 

were RB . Even more striking was that the pRB expression also significantly reduced colony 
formation of two RB + (with normal RB alleles) tumor cell lines examined, namely the 
fibrosarcoma cell line, HT1080, and the cervical carcinoma cell line, HeLa (Xu etah, 1994b), 
while no such effects were observed when an additional pRB H0 -coding gene(s) was introduced 



by transfection using plasmid vectors (Fung et al, 1993) or by microcell fusion (Anderson et al, 
1994). 

However, there is a paucity of tumor suppressor proteins in the art which have all of the 
properties necessary to facilitate their use in the treatment of diseases, particularly cancer. 

SUMMARY OF THE INVENTION 

The modified retinoblastoma tumor suppressors of the present invention overcome the 
shortcomings of those described in the art, providing a broad spectrum tumor suppressor with 
surprising beneficial effects. 

The present invention provides broad-spectrum modified retinoblastoma tumor 
suppressor proteins that are suprisingly at least as effective, and in most cases more effective, 
than the corresponding wild-type retinoblastoma tumor suppressor proteins in inhibiting cell 
growth. In particular embodiments, the invention provides retinoblastoma tumor suppressor 
proteins that have a modified N-terminal region. The invention further provides methods of 
making and using the modified retinoblastoma tumor suppressor proteins, particularly in 
circumstances wherein cell growth inhibition is desired. Thus the present invention provides 
methods for treating diseases, as exemplified by, but not limited to cancer, that are characterized 
by abnormal cellular proliferation. 

A broad-spectrum tumor suppressor gene is a genetic sequence coding for a protein that, 
when inserted into and expressed in an abnormally proliferating host cell, e.g., a tumor cell, 
suppresses abnormal proliferation of that cell irrespective of the cause of the abnormal 
proliferation. 

Thus, the invention provides an isolated DNA segment comprising an isolated gene 
encoding a modified retinoblastoma tumor suppressor protein other than pRB 94 or pRB 56 , the 
modified retinoblastoma tumor suppressor protein comprising an N-terminal modification. The 



terms "pRB " and M pRB " refer to retinoblastoma proteins that have a molecular weight of 94 
kDa and 56 kDa, respectively. As understood in the art, the pRB 94 and pRB 56 retinoblastoma 
proteins are fragments of the full length wild-type retinoblastoma protein that have 1 12 and 379 
contiguous amino acids deleted from the N-terminus, respectively. 

The term "N-terminar, or "N-terminal region", as used herein, will be understood to refer 
to the region of a protein corresponding to as much as the first approximately 40% of the amino 
acid sequence. Thus, these terms will be understood to include up to about the first 5%, the first 
10%, the first 15%, the first 20%, the first 25%, the first 30% or the first 35% of the amino acid 
sequence of a protein. However, these values are only approximations, and therefore will be 
understood to include intermediate values, such as 2%, 3%, 6%, 7%, 11%, 13%, 17%, 18%, 
22%, 26%, 33%, 37%, 38%, 41%, 42% and the like. 

The term "modified", as used herein, refers to deletions and/or mutations of the wild-type 
protein sequence. In certain embodiments, it may also refer to insertion of a heterologous amino 
acid or amino acids into the wild-type protein sequence. In yet other aspects, the term may refer 
to post-translational alteration of the wild-type amino acid sequence. 

In a further embodiment of the invention, the gene encodes a modified retinoblastoma 
tumor suppressor protein comprising an N-terminal region that comprises a first sequence region 
from which at least one amino acid has been deleted. The deletion may produce a modified 
retinoblastoma tumor suppressor protein with a biological activity equal to, or in certain 
embodiments, greater than the biological activity of the corresponding wild-type retinoblastoma 
tumor suppressor protein. 

In a particular embodiment of the invention the gene encodes a modified retinoblastoma 
tumor suppressor protein wherein at least two amino acids have been deleted from the first 
sequence region. In other embodiments of the invention at least about five amino acids, at least 
about ten amino acids, at least about 25 amino acids, at least about 50 amino acids, at least about 
75 amino acids or at least about 100 amino acids have been deleted from the first sequence 



region. It will be understood that intermediate deletion sizes are also contemplated, such as, but 
not limited to, 3, 4, 6, 7, 8, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 
30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 51, 52, 53, 54, 55, 56, 
57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 77, 78, 79, 80, 81, 82, 83, 
5 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 or 99 amino acids and the like. 

In other aspects of the invention, the gene encodes a modified retinoblastoma tumor 
suppressor protein wherein at least about 150 amino acids, at least about 200 amino acids, at lest 
about 250 amino acids, at least about 300 amino acids or at least about 370 amino acids have 
10 been deleted from the first sequence region. However, intermediate sized deletions are also 
provided, exemplified by, but not limited to, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 
Q 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 
| 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 
Pj 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 
j| 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 
jlj 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 
^ 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 
226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 
J 245, 246, 247, 248, 249, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 
g) 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 
284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 301, 302, 303, 
304, 305, 306, 307, 308, 309, 310, 311, 312, 313, 314, 315, 316, 317, 318, 319, 320, 321, 322, 
323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 
342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 356, 357, 358, 359, 360, 
25 361, 362, 363, 364, 365, 366, 367, 368, 369, 371, 372, 373, 374, 375, 376, 377 or 378 amino acid 
deletions. Other intermediate values are disclosed throughout the specification. 

In one embodiment of the invention the gene encodes a modified retinoblastoma tumor 
suppressor protein comprising an N-terminal region that comprises at least a first sequence 
30 region located between about amino acid 1 and about amino acid 50 from which at least one 
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amino acid has been deleted. It will be understood that "between about amino acid 1 and about 
amino acid 50" includes amino acid 1 and amino acid 50, and it is thus so with other deletions 
described herein. Amino acid 1 is the N-terminal amino acid, and the numbers increase toward 
the C-terminus. 

5 

In further embodiments of the invention, the first sequence region is located between 
about amino acid 51 and about amino acid 100, between about amino acid 101 and about amino 
acid 150, between about amino acid 151 and about amino acid 200, between about amino acid 
201 and about amino acid 250 or between about amino acid 251 and about amino acid 300. 

10 

In other embodiments of the present invention, the gene encodes a modified 
_ retinoblastoma tumor suppressor protein wherein the first sequence region is located between 
€3 about amino acid 1 and about amino acid 100, between about amino acid 51 and about amino 
|:j acid 150, between about amino acid 101 and about amino acid 200, between about amino acid 
p 151 and about amino acid 250 or between about amino acid 201 and about amino acid 300. 

3 In a particular aspect of the invention the gene encodes a modified retinoblastoma tumor 

g suppressor protein wherein the first sequence region is located between about amino acid 1 and 
^ about amino acid 150. In additional aspects of the invention the first sequence region is located 
2f) between about amino acid 51 and about amino acid 200, between about amino acid 101 and 
about amino acid 250 or between about amino acid 1 5 1 and about amino acid 300. 

In further embodiments of the invention the gene encodes a modified retinoblastoma 
tumor suppressor protein wherein the first sequence region is located between about amino acid 1 
25 and about amino acid 200, between about amino acid 51 and about amino acid 250, between 
about amino acid 101 and about amino acid 300, between about amino acid 1 and about amino 
acid 250, between about amino acid 51 and about amino acid 300, between about amino acid 1 
and about amino acid 300 or between about amino acid 1 and about amino acid 370. 
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In yet another aspect of the invention the modified retinoblastoma tumor suppressor 
protein is a modified retinoblastoma protein wherein about amino acid 2 through about amino 
acid 34 have been deleted from the first sequence region. The location of these particular amino 
acids is in reference to the human wild-type retinoblastoma protein, but will be understood to 
5 correspond to analogous regions of homologous retinoblastoma proteins. In yet another aspect of 
the invention about amino acid 2 through about amino acid 55 have been deleted from the first 
sequence region. In still another aspect of the invention about amino acid 2 through about amino 
acid 78 have been deleted from the first sequence region. In a particular aspect of the invention 
about amino acid 2 through about amino acid 97 have been deleted from the first sequence 
10 region. In an additional aspect of the invention about amino acid 2 through about amino acid 
148 have been deleted from the first sequence region. 

! %i In another embodiment of the invention the modified retinoblastoma tumor suppressor 

II J protein is a modified retinoblastoma protein wherein about amino acid 3 1 through about amino 
|| acid 107 have been deleted from the first sequence region. In another embodiment of the 

invention about amino acid 77 through about amino acid 107 have been deleted from the first 
I sequence region. In a further embodiment of the invention about amino acid 1 1 1 through about 
fy amino acid 181 have been deleted from the first sequence region. In yet another embodiment of 
J, the invention about amino acid 111 through about amino acid 241 have been deleted from the 
|l first sequence region. In still another embodiment of the invention about amino acid 1 8 1 through 

about amino acid 241 have been deleted from the first sequence region. In a particular 

embodiment of the invention about amino acid 242 through about amino acid 300 have been 

deleted from the first sequence region. 

25 In one aspect of the invention the N-terminal region of the modified retinoblastoma tumor 

suppressor protein further comprises at least a second sequence region from which at least one 
amino acid has been deleted. In a particular aspect of the invention, about amino acid 2 through 
about amino acid 34, and about amino acid 76 through about amino acid 1 12 have been deleted. 
In a further aspect of the invention about amino acid 2 through about amino acid 55, and about 

30 amino acid 76 through about amino acid 112 have been deleted. 

-10- 



Another embodiment of the invention provides a DNA segment comprising an isolated 
gene encoding a modified retinoblastoma tumor suppressor protein other than pRB 94 , the 
modified retinoblastoma tumor suppressor protein comprising an N-terminal modification 
5 wherein the gene encodes a modified retinoblastoma tumor suppressor protein comprising at 
least a first N-terminal mutation, and wherein the modified retinoblastoma tumor suppressor 
protein has an increased biological activity in comparison to the biological activity of the 
corresponding wild type retinoblastoma tumor suppressor protein. In one embodiment of the 
invention the gene encodes a modified retinoblastoma protein comprising a mutation at position 

10 111. In another embodiment of the invention the modified retinoblastoma protein comprises 
glycine at position 1 1 1 in place of aspartic acid. 

§ In a further embodiment of the invention the modified retinoblastoma tumor suppressor 

fy protein comprises at least a second N-terminal mutation. In yet another embodiment of the 

Jl invention the gene encodes a modified retinoblastoma protein comprising a mutation at position 
1 1 1 and a mutation at position 112. In still another embodiment of the invention the modified 

s retinoblastoma protein comprises glycine at position 1 1 1 in place of aspartic acid, and aspartic 

11 j acid at position 1 12 in place of glutamic acid. In a particular embodiment of the invention the 
\I gene encodes a modified retinoblastoma tumor suppressor protein comprising an N-terminal 
21 region from which at least one amino acid has been deleted, and which contains at least one 

amino acid mutation. 

In one aspect of the invention the gene encodes a modified retinoblastoma tumor 
suppressor protein that comprises a contiguous amino acid sequence from at least about position 

25 370 to about position 928 of SEQ ID NO:2. In another aspect of the invention the gene encodes 
a modified retinoblastoma tumor suppressor protein that comprises a contiguous amino acid 
sequence from at least about position 3 to about position 928 of SEQ ID NO:2. When used in 
this context, "a contiguous amino acid sequence" will be understood to be a contiguous amino 
acid sequence of at least about 8, about 10, about 12, about 15, about 20, about 25, about 50 or 

30 about 100 amino acids and so on up to the full length amino acid sequence. 
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In a further aspect of the invention the gene encodes a modified retinoblastoma protein 
comprising a contiguous amino acid sequence of SEQ ID NO:29. In yet another aspect of the 
invention the gene comprises a contiguous nucleic acid sequence from between position 7 and 
5 position 2691 of SEQ ID NO:28. When used herein in this context, n a contiguous nucleic acid 
sequence" will be understood to be a contiguous nucleic acid sequence of at least about 8, about 
10, about 12, about 15, about 17, about 20, about 25, about 50 or about 100 nucleotides and so on 
up to the full length nucleotide sequence. 

10 In still another aspect of the invention the gene encodes a modified retinoblastoma 

protein comprising a contiguous amino acid sequence of SEQ ID NO:31. In a particular aspect 
_ of the invention the gene comprises a contiguous nucleic acid sequence from between position 7 
4J and position 2628 of SEQ ID NO:30. In an additional aspect of the invention the gene encodes a 
ify modified retinoblastoma protein comprising a contiguous amino acid sequence of SEQ ID 
Si NO:33. 

s In another embodiment of the invention the gene comprises a contiguous nucleic acid 

JTj sequence from between position 7 and position 2559 of SEQ ID NO:32. In a further 
^ embodiment of the invention the gene encodes a modified retinoblastoma protein comprising a 
2§ contiguous amino acid sequence of SEQ ID NO:35. In yet another embodiment of the invention 
the gene comprises a contiguous nucleic acid sequence from between position 7 and position 
2502 of SEQ ID NO:34. In still another embodiment of the invention the gene encodes a 
modified retinoblastoma protein comprising a contiguous amino acid sequence of SEQ ID 
NO:37. In a particular embodiment of the invention the gene comprises a contiguous nucleic 
25 acid sequence from between position 7 and position 2349 of SEQ ID NO:36. In an additional 
embodiment of the invention the gene encodes a modified retinoblastoma protein comprising a 
contiguous amino acid sequence of SEQ ID NO:39. 



30 



In one aspect of the invention the gene comprises a contiguous nucleic acid sequence 
from between position 7 and position 2559 of SEQ ID NO:38. In another aspect of the invention 
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the gene encodes a modified retinoblastoma protein comprising a contiguous amino acid 
sequence of SEQ ID NO:41 . In a further aspect of the invention the gene comprises a contiguous 
nucleic acid sequence from between position 7 and position 2697 of SEQ ID NO:40. In yet 
another aspect of the invention the gene encodes a modified retinoblastoma protein comprising a 
5 contiguous amino acid sequence of SEQ ID NO: 43. In still another aspect of the invention the 
gene comprises a contiguous nucleic acid sequence from between position 7 and position 2583 of 
SEQ ID NO:42. In a particular aspect of the invention the gene encodes a modified 
retinoblastoma protein comprising a contiguous amino acid sequence of SEQ ID NO:45. In an 
additional aspect of the invention the gene comprises a contiguous nucleic acid sequence from 
1 0 between position 7 and position 2397 of SEQ ID NO:44. 

^ In one embodiment of the invention the gene encodes a modified retinoblastoma protein 

* comprising a contiguous amino acid sequence of SEQ ID NO:47. In another embodiment of the 
: : I invention the gene comprises a contiguous nucleic acid sequence from between position 7 and 
|j position 2613 of SEQ ID NO:46. In a further embodiment of the invention the gene encodes a 
^ modified retinoblastoma protein comprising a contiguous amino acid sequence of SEQ ID 
s NO:49. In yet another embodiment of the invention the gene comprises a contiguous nucleic 
?y acid sequence from between position 7 and position 2619 of SEQ ID NO:48. In still another 
;\I embodiment of the invention the gene encodes a modified retinoblastoma protein comprising a 
2i contiguous amino acid sequence of SEQ ID NO:51. In a particular embodiment of the invention 

the gene comprises a contiguous nucleic acid sequence from between position 7 and position 

2790 of SEQ ID NO:50. 

The invention thus provides a gene encodes a modified retinoblastoma protein 
25 comprising a contiguous amino acid sequence of SEQ ID NO:29, SEQ ID NO:31 ? SEQ ID 
NO:33, SEQ ID NO:35 ? SEQ ID NO:37, SEQ ID NO:39 ? SEQ ID NO:41, SEQ ID NO:43, SEQ 
ID NO:45, SEQ ID NO:47, SEQ ID NO:49 or SEQ ID NO:51. In one aspect of the invention the 
gene comprises a contiguous nucleic acid sequence from between position 7 and position 2691 of 
SEQ ID NO:28, from between position 7 and position 2628 of SEQ ID NO:30, from between 
30 position 7 and position 2559 of SEQ ID NO:32, from between position 7 and position 2502 of 



SEQ ID NO:34 ? from between position 7 and position 2349 of SEQ ID NO:36, from between 
position 7 and position 2559 of SEQ ID NO:38, from between position 7 and position 2697 of 
SEQ ID NO:40 ? from between position 7 and position 2583 of SEQ ID NO:42, from between 
position 7 and position 2397 of SEQ ID NO:44, from between position 7 and position 2613 of 
SEQ ID NO:46, from between position 7 and position 2619 of SEQ ID NO:48 or from between 
position 7 and position 2790 of SEQ ID NO:50. 

Another embodiment of the invention provides a DNA segment comprising an isolated 
gene encoding a modified retinoblastoma tumor suppressor protein other than pRB 94 or pRB 56 , 
the modified retinoblastoma tumor suppressor protein comprising an N-terminal modification, 
where the DNA segment is operationally positioned under the control of a promoter. In one 
embodiment of the invention this DNA segment is operationally positioned under the control of a 
recombinant promoter. In another embodiment of the invention the DNA segment is further 
defined as a recombinant vector. In a particular aspect of the present invention, the recombinant 
vector is an adenoviral vector. In another aspect, the recombinant vector is a retroviral vector. 

In a further embodiment of the invention the DNA segment is further defined as a 
component of a tetracycline responsive expression system. In yet another embodiment of the 
invention the DNA segment is operatively positioned downstream of a promoter comprising a 
tetracycline operator nucleic acid sequence; the tetracycline responsive expression system further 
comprising a second sequence region comprising an isolated gene encoding a fusion protein 
comprising a transcriptional transactivation domain operatively attached to a tetracycline 
repressor protein, the second sequence region operatively positioned downstream of a minimal 
promoter. 

In yet another embodiment of the invention the tetracycline responsive expression system 
is comprised within an adenoviral vector. In still another embodiment of the invention the 
adenoviral vector is comprised within a recombinant adenovirus. 



-14- 



The invention also provides a DNA segment comprising an isolated gene encoding a 
modified retinoblastoma tumor suppressor protein other than pRB 94 , the modified retinoblastoma 
tumor suppressor protein comprising an N-terminal modification, which is comprised within a 
host cell. In one embodiment of the invention the host cell is a prokaryotic cell In another 
5 embodiment of the invention the host cell is a eukaryotic celL In a further embodiment of the 
invention the host cell is a human cell. In yet another embodiment of the invention the host cell 
is a tumor cell. In still another embodiment of the invention the host cell is comprised within an 
animal In a particular embodiment of the invention the animal is a human subject. 

10 Another embodiment of the invention provides a DNA segment comprising an isolated 

gene encoding a modified retinoblastoma tumor suppressor protein other than pRB 94 , the 

f ^ modified retinoblastoma tumor suppressor protein comprising an N-terminal modification, which 
is dispersed in a pharmaceutical^ acceptable excipient 

|f Yet another embodiment of the invention provides an isolated DNA segment comprising 

^ an isolated gene encoding a modified retinoblastoma tumor suppressor protein other than pRB 94 , 
s the modified retinoblastoma tumor suppressor protein comprising an N-terminal modification, 
ft j wherein the modified retinoblastoma tumor suppressor protein is characterized as: comprising an 
^ N-terminal region that comprises at least a first sequence region from which at least one amino 
2© acid has been deleted, and wherein the modified retinoblastoma tumor suppressor protein has a 
biological activity at least about equivalent to the biological activity of the corresponding 
wild-type retinoblastoma tumor suppressor protein; or comprising an N-terminal region that 
comprises a first sequence region comprising at least one mutation, and wherein the modified 
retinoblastoma tumor suppressor protein has an increased biological activity in comparison to the 
25 biological activity of the corresponding wild-type retinoblastoma tumor suppressor protein. 

In certain aspects of the invention, the DNA segments as described above are 
contemplated for use in expressing a modified retinoblastoma tumor suppressor protein, for 
example in a host cell In other aspects, the DNA segments are contemplated for use in 
30 inhibiting cellular proliferation, or in the preparation of a medicament for inhibiting cellular 
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proliferation or treating cancer, for example in a human patient. Thus, the use of the instant 
DNA segments in the preparation of a modified retinoblastoma tumor suppressor protein, in 
inhibiting cellular proliferation, and in the preparation of a medicament for inhibiting cellular 
proliferation or treating cancer is provided. In certain uses, the medicament is intended for 
5 administration to a human patient, or formulated for parenteral administration. 

The invention further provides a modified retinoblastoma tumor suppressor protein other 
than pRB 94 , the modified retinoblastoma tumor suppressor protein comprising an N-terminal 
modification. 

10 

The invention also provides a recombinant host cell comprising a DNA segment 
p , comprising an isolated gene encoding a modified retinoblastoma tumor suppressor protein other 
than pRB 94 , the modified retinoblastoma tumor suppressor protein comprising an N-terminal 
fU modification. In one aspect of the invention the host cell is a prokaryotic host cell. In another 
|i aspect of the invention the host cell is E. coli. In a further aspect of the invention the host cell is 
y ,} a eukaryotic host cell. In yet another aspect of the invention the host cell is a tumor cell In still 
s another aspect of the invention the DNA segment is introduced into the cell by means of a 
ry recombinant vector. 

2® The invention further provides a method of inhibiting cellular proliferation, comprising 

contacting a cell with an effective inhibitory amount of a first modified retinoblastoma tumor 
suppressor protein other than pRB 94 , the modified retinoblastoma tumor suppressor protein 
comprising an N-terminal modification. In one embodiment of the invention the first modified 
retinoblastoma tumor suppressor protein comprises a modified retinoblastoma protein from 

25 which amino acids 1 1 1 through 241 have been deleted. In another embodiment of the invention 
the first modified retinoblastoma tumor suppressor protein comprises a modified retinoblastoma 
protein that comprises a mutation at position 1 1 1 and position 112. In a further embodiment of 
the invention the first modified retinoblastoma tumor suppressor protein is prepared by 
expressing a DNA segment encoding the modified retinoblastoma tumor suppressor protein in a 

30 recombinant host cell and collecting the modified retinoblastoma tumor suppressor protein 
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expressed by the cell. In yet another embodiment of the invention the cell is contacted with the 
first modified retinoblastoma tumor suppressor protein by providing to the cell a DNA segment 
that expresses the first modified retinoblastoma tumor suppressor protein in the cell. In still 
another embodiment of the invention the cell is provided with a tetracycline responsive 
expression vector system that expresses the first modified retinoblastoma tumor suppressor 
protein in the cell. In a particular embodiment of the invention the vector system is an adenoviral 
vector system. 

Another aspect of the invention provides a method of inhibiting cellular proliferation, 
comprising contacting a tumor cell with an effective inhibitory amount of a first modified 
retinoblastoma tumor suppressor protein other than pRB 94 , the protein comprising an N-terminal 
modification. In one aspect of the invention the cell is located within an animal and the first 
modified retinoblastoma tumor suppressor protein, or a gene encoding the modified 
retinoblastoma tumor suppressor protein, is administered to the animal in a pharmaceutical^ 
acceptable vehicle. As used herein, the term "gene" is defined as an isolated DNA segment that 
includes the coding region of the protein, or a portion thereof. Thus the term "gene" includes 
genomic DNA, cDNA or RNA encoding the protein. 

In another aspect of the invention the animal is a human subject. In a further aspect of 
the invention the cell is further contacted with a second tumor suppressor protein. In yet another 
aspect of the invention the cell is contacted with a modified retinoblastoma protein and a 
wild-type retinoblastoma, p53 or other tumor suppressor protein. 

The invention further provides a method of inhibiting cellular proliferation, comprising 
contacting a cell with a retinoblastoma protein and a p53 protein in a combined amount effective 
to inhibit cellular proliferation in the cell. 

The invention also provides a method of treating cancer, comprising administering to an 
animal with cancer a pharmaceutically acceptable composition comprising a biologically 
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effective inhibitory amount of a first modified retinoblastoma tumor suppressor protein, other 
than pRB 94 , that comprises an N-terminal modification. 

The terms "cancer" or "tumor" are clinically descriptive terms which encompass a myriad 
of diseases characterized by cells that exhibit unchecked and abnormal cellular proliferation. 
The term "tumor", when applied to tissue, generally refers to any abnormal tissue growth, i.e., 
excessive and abnormal cellular proliferation. A tumor may be "benign" and unable to spread 
from its original focus, or "malignant" and capable of spreading beyond its anatomical site to 
other areas throughout the hostbody. The term "cancer" is an older term which is generally used 
to describe a malignant tumor or the disease state arising therefrom. Alternatively, the art refers 
to an abnormal growth as a neoplasm, and to a malignant abnormal growth as a malignant 
neoplasm. 

Irrespective of whether the growth is classified as malignant or benign, the causes of 
excessive or abnormal cellular proliferation of tumor or cancer cells are not completely clear. 
Nevertheless, there is persuasive evidence that abnormal cellular proliferation is the result of a 
failure of one or more of the mechanisms controlling cell growth and division. It is also now 
believed that the mechanisms controlling cell growth and division include the genetic and tissue- 
mediated regulation of cell growth, mitosis and differentiation. These mechanisms are thought to 
act at the cell nucleus, the cell cytoplasm, the cell membrane and the tissue-specific environment 
of each cell. The process of transformation of a cell from a normal state to a condition of 
excessive or abnormal cellular proliferation is called tumorigenesis. 

It has been observed that tumorigenesis is usually a multistep progression from a normal 
cellular state to, in some instances, a full malignancy. It is therefore believed that multiple "hits" 
upon the cell regulatory mechanisms are required for full malignancy to develop. Thus, in most 
instances, it is believed that there is no single cause of excessive proliferation, but that these 
disorders are the end result of a series of cumulative events. 
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While a malignant tumor or cancer capable of unchecked and rapid spread throughout the 
body is the most feared and usually the deadliest type of tumor, even so-called benign tumors or 
growths can cause significant morbidity and mortality by their inappropriate growth. A benign 
tumor can cause significant damage and disfigurement by inappropriate growth in cosmetically 
sensitive areas, or by exerting pressure on central or peripheral nervous tissue, blood vessels and 
other critical anatomical structures. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
reference to one or more of these drawings in combination with the detailed description of 
specific embodiments presented herein. 

FIG. 1. Relative activities of the modified hCMV promoters. The 5637 bladder 
carcinoma cells (lanes 1-5) and Saos2 osteocarcinoma cells (lanes 6-10) were transfected with 
reporter plasmids in which CAT gene expression was driven by the various modified 
(mhCMVp3, lanes 2 and 7; mhCMVp2, lanes 3 and 8; mhCMVpl, lanes 4 and 9) or full-length 
hCMV promoters (lanes 5 and 10). The % CAT activity is shown on the vertical axis. The CAT 
activity of the cells transfected with the plasmid carrying the full-length hCMV promoter (lanes 5 
and 10) is defined as 100 percent. 

FIG. 2. Expression of tTA from the modified mCMVp-tTA cassette has no squelching 
effects on the 5637 cell growth. A method of staining cells with crystal violet followed by 
measuring OD 550 was used for quantification of relative cell numbers (OD 550 shown on vertical 
axis; Gillies et al, 1986). Shown is the growth parent cells with (A) and without (□) 
tetracycline, and the mCMVp-r7^ transfected cells with (♦) and without (O) tetracycline. Days 
after transfection are shown on the horizontal axis. 
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FIG. 3A, FIG. 3B and FIG. 3C. The effects of tetracycline-regulatable pRB expression 
on tumor cell growth (OD 550 ; vertical axis). FIG. 3A. Representative long-term clone from the 
^-reconstituted osteosarcoma cell line (Saos-2, clone 11). FIG. 3B. Representative long-term 
clone from the ^-reconstituted breast carcinoma cell line (MDA-MB-468, clone 19-4). 
FIG. 3C. Representative long-term clone from the ^-reconstituted bladder carcinoma cell line 
(5637, clone 34-6). The cells were grown in the presence of 0.5 (ig/ml of Tc (□) versus absence 
of Tc (O). Cell growth of the tumor cells stopped 1 to 2 days after pRB expression was turned 
on in Tc-free medium (days shown on horizontal axis). The growth cessation was irreversible at 
day 4 (arrows) after stimulation with fresh medium containing 15% serum (Saos-2), 10% serum 
plus 2 jag/ml phytohemagglutinin (PHA; MDA-468) or 10% serum plus 4 (ag/ml of 
concanavalin A (Con A; 5637). 

FIG. 4A, FIG. 4B and FIG. 4C. The effects of tetracycline-regulatable pRB expression 
on soft agar colony formation. FIG. 4A. Percent colony formation (vertical axis) for three 
independent Saos2 osteosarcoma cell line clones (RBI 10 C14, lane 2; RBI 10 Clll, lane 3; 
RBI 10 C113, lane 4) and the Saos2 parent strain (lane 1). FIG. 4B. Percent colony formation 
(vertical axis) for two independent MDA-MB-468 breast carcinoma cell line clones (RbllO 
C119-4, lane 2; RM10C120-1, lane 3) and the MDA-MB-468 parent strain (lane 1). FIG. 4C. 
Percent colony formation (vertical axis) for two independent 5637 bladder carcinoma cell line 
clones (RbllO C134-6, lane 2; RbllO C136-9, lane 3) and the 5637 parent strain (lane 1). Soft 
agar colony formation of tumor cells with tetracycline-regulatable pRB expression was 
completely abrogated by induction of pRB in tetracycline-free medium. Colony formation is 
shown in the presence (open bar) and the absence (hatched bar) of tetracycline. 

FIG. 5. Time course analysis of the pRB 94 and pRB 110 expression in representative, Tc- 
regulatable Saos-2 cell clones in Tc-free media and its effects on DNA synthesis, using a 
H-thymidine incorporation assay. Lack of DNA synthesis as determined by failure of the tumor 
cells to incorporate thymidine implies growth cessation. The non-synchronized parental Saos-2 
cell population (•) maintained steady DNA synthesis; Representative pRB 110 -reconstituted (■) 
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and pRB -reconstituted (♦) Saos-2 clones are illustrated. Percent 3 H-labeled cells is shown on 
the vertical axis, and the hours after removal of tetracycline is shown on the horizontal axis. 

DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

A. Tumor Suppressor Proteins 
1. Retinoblastoma 

Based upon study of the isolated RB cDNA clone, the predicted RB gene product has 928 
amino acids and an expected molecular weight of 106 kDa (Lee etal, 1987a; 1987b). The 
natural factor corresponding to the predicted RB gene expression product has been identified as a 
nuclear phosphoprotein having an apparent relative molecular mass (M r ) of between 105 and 114 
kDa (Lee et al, 1987b; Xu et al, 1989b; Yokota et al, 1988; Whyte et al, 1988). The literature 
generally refers to the protein encoded by the RB gene as pi 10™. On SDS-PAGE normal 
human cells show an RB protein pattern consisting of a lower sharp band with an Mr of 1 10 kD 
and a broader, more variable region above this band with an M r ranging from 1 10 kD to 1 16 kD. 
The 1 10 kD band is the underphosphorylated RB protein, whereas the broader region represents 
the phosphorylated RB protein. The heterogeneity of the molecular mass results from a varying 
degree of phosphorylation (Xu et al, 1989b). 

After years of intense scrutiny, the biological functions of the RB gene are beginning to 
be understood (reviewed in Cooper and Whyte, 1989; Hamel etal, 1993; Horowitz, 1993; Riley 
etal, 1994; Wang etal, 1994; Weinberg, 1995). The RB protein shows cyclical changes in 
phosphorylation during the cell cycle. Most RB protein is unphosphorylated during Gl phase, 
but most (perhaps all) RB molecules are phosphorylated in S and G2 phases (Xu etal, 1989b; 
DeCaprio etal, 1989; Buchkovich etal 9 1989; Chen etal, 1989; Mihara etal, 1989). The 
established components of the pRB pathway include the E2F transcription factors, which are 
involved in transcriptional control of numerous cellular genes responsible for the advances of 
cells through the cell cycle (Nevins, 1992; La Thangue, 1994). The pRB also interacts with 
certain Gl phase cyclins (Koff etal, 1992; Resnitzky and Reed, 1995; Geng etal, 1996). 
Therefore, the RB gene apparently plays a key role in cell growth regulation being involved in 
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the major decisions during the Gl phase of the cell cycle which govern cell proliferation, 
quiescence and differentiation (Weinberg, 1995). Furthermore, only the underphosphorylated 
RB protein binds to SV40 large T antigen. Given that RB protein binding by large T antigen is 
probably important for the growth promoting effects of large T antigen, this suggests that the 
underphosphorylated RB protein is the active form of the RB protein, and the phosphorylated RB 
protein in S and G2 phases is inactive (Ludlow et aL, 1989). 

It was reported that there was a striking difference in the ratio of underphosphorylated to 
phosphorylated pRB forms between normal fibroblasts growing exponentially and those arrested 
in Gl phase. More underphosphorylated pRB was observed in Gl arrested cells, suggesting the 
change in ratio of phosphorylated to underphosphorylated RB proteins was related to the 
fluctuation of cell cycle (Xu etal, 1989b). Four subsequent papers have described the cell 
cycle-dependent phosphorylation of RB protein in detail (DeCaprio etal, 1989; Buchkovich 
et al ; 1 989; Chen et al , 1 989; Mihara et al , 1 989). It is now widely accepted that the product of 
the RB gene has a key role in the cell cycle control. 

Cell proliferation depends on transcriptional activation of genes that are responsible for 
the onset of DNA synthesis as well as other critical events in the Gl phase of the cell cycle. As 
demonstrated by Pardee, transition of cells from a serum mitogens-dependent to serum mitogens- 
independent state is separated by a distinct time point at several hours before the onset of S 
phase, namely the R (restriction) point (Pardee, 1989). By passing through the R point, the cell 
commits itself to complete the remainder of the cell cycle through M phase. Therefore, the R 
point between the middle Gl and late Gl phases of the cell cycle represents a transition in the 
life of the cell that is as important as the Gl/S boundary. 

The phosphorylation status of pRB undergoes a readily distinguishable alteration at a 
time close to and perhaps contemporaneous with the R point transition of the cell cycle 
(Weinberg, 1995). During middle Gl phase, the only pRB species detected is an 
underphosphorylated form. When cells progress through the cell cycle, the pRB content 
increases gradually. However, the majority of pRB synthesized after middle Gl phase is 
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hyperphosphorylated. In other words, pRB hyperphosphorylation occurs in late Gl, preceding 
the Gl/S boundary (Xu etaL, 1991a; Mittnacht etaL, 1994). pRB maintains this 
hyperphosphorylated status throughout the remainder of the cell cycle, becoming 
dephosphorylated only upon evolution from M/early Gl (Ludlow etaL, 1990; Xu etaL, 1991a; 
Mittnacht et al. , 1994). 

The underphosphorylated form of pRB is able to form complexes with the transcription 
factor E2Fs or directly interact with the E2F site, and switches the E2F site from a positive to 
negative element in transcriptional control. The E2F site is present in the promoters of diverse 
cellular genes that are responsible for the advances of cells through the cell cycle, including 
c-myc, B-myb, cdc2, dihydrofolate reductase, thymidine kinase, and the RB as well as the E2F-1 
gene itself (Chellappan etaL, 1991; Nevins, 1992; Weintraub etaL, 1992; La Thangue, 1994; 
Shan etaL, 1994; Sardet etaL, 1995; Shan etaL, 1996). Since hyperphosphorylated pRB 
appears to have lost the ability to interact with E2Fs, the inhibitory function of pRB on cell 
growth can be abrogated by hyperphosphorylation. 

The timing of pRB phosphorylation led to an attractive functional model (Weinberg, 
1995). This model suggests that pRB is an R point guardian. pRB exerts most of its growth 
inhibitory effects in the first two thirds of the Gl phase. A cell that has progressed through early 
and middle Gl encounters the R point gate. Should conditions be ready for advance into the 
remainder of the cell cycle, pRB will undergo phosphorylation and functional inactivation, 
causing it to open the gate and to permit the cell to proceed into late Gl . Cells that lack normal 
pRB function for various reasons will proceed freely into late Gl. Without pRB, the upstream 
components of the cell cycle clock that regulate pRB phosphorylation, such as cyclin D, cyclin E 
and their corresponding cyclin-dependent kinases (CDKs) (Kato etaL, 1993; Ewen etaL, 1993) 
lose much of their influences in the decision of the cell to pass through the R point gate. Taken 
together, pRB allows the cell cycle clock to control the expression of numerous genes that 
mediate advance of the cell through a critical phase of its growth cycle being involved in the 
major decisions concurrent with the R point transition. Functional loss of pRB deprives the cell 
of this clock and thus of an important mechanism for braking cell proliferation. 
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Various mutations of the RB gene are known, and these are generally inactive. Mutations 
in RB are seen in virtually all cases of retinoblastoma; additionally, the RB gene products could 
potentially be inactivated by hyperphosphorylation, and by viral oncoprotein-like cellular protein 
binding. Although the RB gene was initially named because deletions or mutations within the 
gene caused the rare childhood ocular tumor, retinoblastoma, loss of pRB function is not only 
causally related to the retinoblastoma, but is also linked to the progression of many common 
human cancers. Additionally, there is growing evidence suggesting that the RB protein status is 
potentially a prognostic marker in urothelial carcinoma, non-small cell lung carcinoma, and 
perhaps also in some other types of human neoplasms (Xu, 1995). 

In addition, with the revolutionary antigen retrieval technique and the available specific 
anti-pRB antibodies, immunohistochemistry has recently become one of the highly sensitive and 
reliable methods for detection of pRB inactivation in routinely processed pathological specimens 
(Xu, 1995). Altered pRB expression as determined by immunohistochemical analysis appears to 
signal a poor prognosis in a subset of human malignancies. It was initially reported that loss of 
functional pRB was a statistically significant negative prognostic factor in high-grade adult soft 
tissue sarcomas (Cance etaL, 1990). Subsequently, two independent studies done concurrently 
concluded that altered pRB expression was a prognostic factor among patients with transitional 
cell carcinoma of the bladder (Cordon-Cardo et al. 9 1992; Logothetis et aL, 1992). 

For lung cancer patients, the initial pilot studies have also been promising, implying that 
altered RB and p53 protein status could be a synergistic prognostic factor in early stage non- 
small cell lung carcinomas (Xu et aL, 1994a). A much worse survival pattern has been reported 
as well for patients with acute myelogenous leukemia who have low or absent levels of pRB 
protein in their peripheral blood leukemic cells (Kornblau et aL, 1994). Since all studies done so 
far to investigate association between the pRB status in human cancer and the clinical outcome 
of the patients have been retrospective, and the number of cases in each cohort was fairly small, 
definitive retrospective and prospective studies with an adequate sample size for statistical 
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calculations are now underway to determine whether or not loss of pRB function can be 
considered as a prognostic factor in clinical practice. 

The most direct proof that the cloned RB gene is indeed a tumor suppressor gene comes 
from introduction of a cloned intact copy of the gene into cancer cells with observed tumor 
suppression function. A number of reports have indicated that replacement of the normal RB 
gene in RB-defective tumor cells from disparate types of human cancers could suppress their 
tumorigenic activity in nude mice (Huang etal, 1988; Goodrich and Lee, 1993; Zhou etal, 
1994b). The tumor cell lines studied were derived from widely disparate types of human cancers 
such as the retinoblastoma, osteosarcoma, carcinomas of the bladder, prostate, breast and lung 
(Table 2). 

Of note, there has been a tendency in the literature to separate the inhibition of cell 
growth by RB replacement in RB-defective tumor cells from its tumor suppression function 
(Takahashi etal, 1991; Chen etal, 1992; Goodrich etal, 1992b; Zhou etal, 1994b). After 
transient transduction with a wild-type pRB -expressing retrovirus or plasmid, as documented in 
several early studies, the RB-deficient retinoblastoma and osteosarcoma tumor cells in culture 
displayed striking changes, including cell enlargement, senescent phenotype and lower growth 
rate (Huang etal, 1988; Templeton etal, 1991). Subsequently, it was found that long-term 
stable clones of the RB-reconstituted tumor cells can be isolated that grew just as rapidly as the 
parental or matched RB" revertant clones. The majority of RB + clones obtained, however, were 
non-tumorigenic or with significantly reduced tumorigenicity in nude mice. The mechanisms for 
the dissociation of suppression of tumorigenicity in nude mice from inhibition of tumor cell 
growth in culture by RB-replacement are unclear. It is certainly possible that RB replacement 
restores sensitivity to a variety of physiologic growth inhibitory signals which may be present 
and supplied to cells when tumorigenicity assay is done in nude mice. Such external growth 
inhibitory agents would be absent under regular cell culture conditions, leading to rapid cell 
growth (Chen et al, 1992). 
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Although the molecular mechanism of the RB-mediated tumor suppression have 
remained unclear, suppression of tumorigenicity of RB~ tumor cells in vivo by re-expressing the 
wild-type pRB implies that the RB gene could be a potential therapeutic target for human cancer. 
In addition, recent reports suggest that RB may also play a role in elicitation of immunogenicity 
5 of tumor cells (Lu etal, 1994; Lu etal, 1996), anti-angiogenesis (Dawson etal, 1995) and 
suppression of tumor invasiveness (Li etal, 1996), which make the emerging RB gene therapy 
even more attractive. In this regard, preclinical studies have recently demonstrated that treatment 
of established human xenograft tumors in nude mice by recombinant adenovirus vectors 
expressing either wild-type or an N-terminal truncated retinoblastoma protein resulted in 

10 regression of the treated tumors (Xu et al, 1996). In addition, a constitutively active form of the 
pRB protein has been tested in a rat artery model of restenosis to inhibit vascular proliferative 

_ disorders following balloon angioplasty (Chang et al , 1 995). 

fy The RB gene expressing the first in-frame AUG codon-initiated RB protein is also 

|| referred to herein as the intact RB gene, the RB 110 gene or the pi \Q m coding gene. It has also 
been observed that lower molecular weight (<100 kD, 98 kD, or 98-104 kD) bands of unknown 
» origin which are immunoreactive to various anti-RB antibodies can be detected in 
}i; immunoprecipitation and Western blots (Xu etaL, 1989b; Furukawa etal, 1990; Stein etal, 
M 1990). 

m 

The RB 110 cDNA open reading frame sequence (McGee etal, 1989) contains a second 
in-frame AUG codon located in exon 3, at nucleotides 355-357. The deduced second AUG 
codon-initiated RB protein would be 98 kD, or 12 kD smaller than the pllO* 8 protein. It has 
been proposed that the lower molecular weight bands are the underphosphorylated (98 kD) and 
25 phosphorylated (98-104 kD) RB protein translated from the second AUG codon of the RB 
mRNA (Xu etal, 1989b), and this was later shown conclusively (Xu etal, U.S. Patent 
5,496,73 1). This protein is referred to as the p94 RB protein. 

It has been proposed that introduction of a functional RB 110 gene into an RB-minus tumor 
30 cell will likely "normalize" the cell. Of course, it was not expected that tumor cells which 
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already have normal RB no gene expression ("RB + ") would respond to RB 110 gene therapy, 
because it was presumed that adding additional RB expression could not correct a non-RB 
genetic defect. In fact, it has been shown that in the case of RB + tumor cell lines, such as the 
osteosarcoma cell line U-2 OS, which expresses the normal pi 10™, introduction of an extra 

RB 

5 pi 10 coding gene did not change the neoplastic phenotype of such tumor lines (Huang et al 9 
1988). 

In the only reported exception, introduction of a pi lO 1 ^ coding vector into normal human 
fibroblasts, WS1, which have no known RB or any other genetic defects, led to the cessation of 
10 cell growth (Fung et al 9 WO 91/15580, 1991). However, it is believed that these findings were 
misinterpreted since a plasmid, ppVUO-Neo, producing SV40 T antigen with a well-known 
^ growth-promoting effect on host cells was used improperly to provide a comparison with the 
] S effect of RB 110 expression on cell growth of transacted WS1 fibroblasts (Fung et al 
fij WO 91/15580, 1991). This view is confirmed by the extensive literature, clearly characterizing 
|| RB + tumor cells as "incurable" by treatment with wild-type RB 110 gene. In addition, it is 
y ; noteworthy that the WS1 cell line per se is a generally recognized non-tumorigenic human 
s diploid fibroblast cell line with limited cell division potential in culture. Therefore, 
fll WO91/15580 simply does not provide any method for effectively treating RB + tumors with an 
^ RB 110 gene. Thus, there remains a need for a broad-spectrum tumor suppressor gene for treating 
2L1 abnormally proliferating cells having any type of genetic defect. 

2. p53 

Somatic cell mutations of the p53 gene are said to be the most frequently mutated gene in 
human cancer (Weinberg, 1991). The normal or wild-type p53 gene is a negative regulator of 

25 cell growth, which, when damaged, favors cell transformation (Weinberg, 1991). As noted for 
the RB protein, the p53 expression product is found in the nucleus, where it may act in parallel 
with or cooperatively with pi 10™. This is suggested by a number of observations, for example, 
both p53 and pi 10™ proteins are targeted for binding or destruction by the oncoproteins of 
SV40, adenovirus and human papillomavirus. Tumor cell lines deleted for p53 have been 

30 successfully treated with wild-type p53 vector to reduce tumorigenicity (Baker etal, 1990). 
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However, the introduction of either p53 or RB 110 into cells that have not undergone lesions at 
these loci does not affect cell proliferation (Marshall, 1991; Baker etal, 1990; Huang etal, 
1988). Such experiments suggest that sensitivity of cells to the suppression of their growth by a 
tumor suppressor gene is dependent on the genetic alterations that have taken place in the cells. 
Such a dependency would be further complicated by the observation in certain cancers that 
alterations in the p53 tumor suppressor or gene locus appear after mutational activation of the ras 
oncogene (Marshall, 1991; Fearon et aL 9 1990a). Therefore, there remains a need for a broad- 
spectrum tumor suppressor gene that does not depend on the specific identification of each 
mutated gene causing abnormal cellular proliferation. 

3, Neurofibromatosis Type 1 

Neurofibromatosis type 1 or von Recklinghausen neurofibromatosis results from the 
inheritance of a predisposing mutant allele or from alleles created through new germline 
mutations (Marshall, 1991). The neurofibromatosis type 1 gene, referred to as the NF1 gene, is a 
relatively large locus exhibiting a mutation rate of around 10" 4 . Defects in the NF1 gene result in 
a spectrum of clinical syndromes ranging from cafe-au-lait spots to neurofibromas of the skin 
and peripheral nerves to Schwannomas and neurofibrosarcomas. The NF1 gene encodes a 
protein of about 2485 amino acids that shares structural similarity with three proteins that 
interact with the products of the ras protooncogene (Weinberg, 1991). For example, the NF1 
amino acid sequence shows sequence homology to the catalytic domain of ras GAP, a GTPase- 
activating protein for p21 ras (Marshall, 1991). 

The role of NF1 in cell cycle regulation is apparently a complex one that is not yet fully 
elucidated. For example, it has been hypothesized that it is a suppressor of oncogenically 
activated p21 ras in yeast (Marshall, 1991 citing Ballester et al 9 1990). On the other hand, other 
possible pathways for NF1 interaction are suggested by the available data (Marshall, 1991; 
Weinberg, 1991). At present, no attempts to treat NF1 cells with a wild-type NF1 gene have 
been undertaken due to the size and complexity of the NF1 locus. Therefore, it would be highly 
desirable to have a broad-spectrum tumor suppressor gene able to treat NF1 and any other type of 
cancer or tumor. 
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4. DCC 

The multiple steps in the tumorigenesis of colon cancer are readily monitored during 
development by colonoscopy. The combination of colonoscopy with the biopsy of the involved 
tissue has uncovered a number of degenerative genetic pathways leading to the result of a 
malignant tumor. One well studied pathway begins with large polyps in which 60% of the cells 
carry a mutated, activated allele of K-ras. A majority of these tumors then proceed to the 
inactivation-mutation of the gene referred to as the deleted in colon carcinoma (DCC) gene, 
followed by the inactivation of the p53 tumor suppressor gene. 

The DCC gene is a more than approximately one million base pair gene coding for a 190- 
kD transmembrane phosphoprotein which is hypothesized to be a receptor (Weinberg, 1991), the 
loss of which allows the affected cell a growth advantage. It has also been noted that the DCC 
has partial sequence homology to the neural cell adhesion molecule (Marshall, 1991) which 
might suggest a role for the DCC protogene in regulating cell to cell interactions. As can be 
appreciated, the large size and complexity of the DCC gene, together with the complexity of the 
K-ras, p53 and possibly other genes involved in colon cancer tumorigenesis demonstrates a need 
for a broad-spectrum tumor suppressor gene and methods of treating colon carcinoma cells which 
do not depend upon manipulation of the DCC gene or on the identification of other specific 
damaged genes in colon carcinoma cells. 

5. Other Tumor Suppressor Proteins 

Examples of additional tumor suppressor genes and candidate tumor suppressor genes 
contemplated for use in combination with the tumor suppressor genes of the present invention 
include, but are not limited to; the Wilms tumor (WT-1) gene (Call etaL, 1990; Gessler etaL, 
1990; Pritchard- Jones et aL, 1990), the von Hippel-Lindau (VHL) disease tumor suppressor gene 
(Duan etaL, 1995), the Maspin (Zou etaL, 1994), Brush- 1 (Schott etaL, 1994) and BRCA 1 
genes (Miki etaL, 1994; Futreal etaL, 1994) for breast cancer, and the multiple tumor 
suppressor (MTS) or pi 6 gene (Serrano et aL, 1993; Kamb et aL 9 1994). 
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B. DNA Delivery via Infection with Viral Vectors 

In certain embodiments of the invention, the tumor suppressor genes may be stably 
integrated into the genome of the cell. In yet further embodiments, the genes may be stably 
maintained in the cell as a separate, episomal segment of DNA. Such nucleic acid segments or 
"episomes" encode sequences sufficient to permit maintenance or replication independent of or 
in synchronization with the host cell cycle. How the tumor suppressor gene is delivered to a cell 
and where in the cell the nucleic acid remains is dependent on the type of expression vector 
employed. 

1. Adenoviral Vectors 

Preferred for use in the present invention are adenovirus vectors, and particularly 
tetracycline-controlled adenovirus vectors. These vectors may be employed to deliver and 
express a wide variety of genes, including, but not limited to, tumor suppressor genes such as the 
retinoblastoma and p53 genes, in addition to cytokine genes such as tumor necrosis factor a, the 
interferon gene family and the interleukin gene family. 

A preferred method for delivery of the expression constructs involves the use of an 
adenovirus expression vector. Although adenovirus vectors are known to have a low capacity for 
integration into genomic DNA, this feature is counterbalanced by the high efficiency of gene 
transfer afforded by these vectors. "Adenovirus expression vector" is meant to include those 
constructs containing adenovirus sequences sufficient to (a) support packaging of the construct in 
host cells with complementary packaging functions and (b) to ultimately express a heterologous 
gene of interest that has been cloned therein. 

The expression vector comprises a genetically engineered form of adenovirus. 
Knowledge of the genetic organization of adenovirus, a 36 kb, linear, double-stranded DNA 
virus, allows substitution of large pieces of adenoviral DNA with foreign sequences (Grunhaus 
and Horwitz, 1992). In contrast to retrovirus, the adenoviral infection of host cells does not 
result in chromosomal integration because wild-type adenoviral DNA can replicate in an 
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episomal manner without potential genotoxicity. Also, adenoviruses are structurally stable, and 
no genome rearrangement has been detected after extensive amplification. 

Adenovirus is particularly suitable for use as a gene transfer vector because of its mid- 
5 sized genome, ease of manipulation, high titer, wide target-cell range and high infectivity. Both 
ends of the viral genome contain 100-200 base pair inverted repeats (ITRs), which are cis 
elements necessary for viral DNA replication and packaging. The early (E) and late (L) regions 
of the genome contain different transcription units that are divided by the onset of viral DNA 
replication. The El region (El A and E1B) encodes proteins responsible for the regulation of 
10 transcription of the viral genome and a few cellular genes. The expression of the E2 region (E2A 
and E2B) results in the synthesis of the proteins for viral DNA replication. These proteins are 
involved in DNA replication, late gene expression and host cell shut-off (Renan, 1990). The 
y3 products of the late genes, including the majority of the viral capsid proteins, are expressed only 
in j after significant processing of a single primary transcript issued by the major late promoter 
%5 (MLP). The MLP, (located at 16.8 m.u.) is particularly efficient during the late phase of 
lil infection, and all the mRNA's issued from this promoter possess a 5 f -tripartite leader (TPL) 
s sequence which makes them preferred mRNA's for translation. 

p ^ In a current system, recombinant adenovirus is generated from homologous 

20 recombination between a shuttle vector and a master plasmid which contains the backbone of the 
adenovirus genome. Due to the possible recombination between the backbone of the adenovirus 
genome, and the cellular DNA of the helper cells which contain the missing portion of the viral 
genome, wild-type adenovirus may be generated from this process. Therefore, it is critical to 
isolate a single clone of virus from an individual plaque and examine its genomic structure. 

25 

Generation and propagation of most adenovirus vectors, which are replication deficient, 
depend on a unique helper cell line, designated 293, which was transformed from human 
embryonic kidney cells by Ad5 DNA fragments and constitutively expresses El proteins (El A 
and E1B; Graham et aL 9 1977). Since the E3 region is dispensable from the adenovirus genome 
30 (Jones and Shenk, 1978), the current adenovirus vectors, with the help of 293 cells, carry foreign 

-31- 



DNA in either the El, the E3 or both regions (Graham and Prevec, 1991). In nature, adenovirus 
can package approximately 105% of the wild-type genome (Ghosh-Choudhury etal, 1987), 
providing capacity for about 2 extra kb of DNA. Combined with the approximately 5.5 kb of 
DNA that is replaceable in the El and E3 regions, the maximum capacity of most adenovirus 
vectors is at least 7.5 kb, or about 15% of the total length of the vector. More than 80% of the 
adenovirus viral genome remains in the vector backbone. 

Gene transfer in vivo using recombinant El -deficient adenoviruses results in early and 
late viral gene expression that may elicit a host immune response, thereby limiting the duration 
of transgene expression and the use of adenoviruses for gene therapy. In order to circumvent 
these potential problems, the prokaryotic Cre-loxP recombination system has been adapted to 
generate recombinant adenoviruses with extended deletions in the viral genome in order to 
minimize expression of immunogenic and/or cytotoxic viral proteins (Lieber et al, 1996). 

Helper cell lines may be derived from human cells such as human embryonic kidney 
cells, muscle cells, hematopoietic cells or other human embryonic mesenchymal or epithelial 
cells. Alternatively, the helper cells may be derived from the cells of other mammalian species 
that are permissive for human adenovirus. Such cells include, e.g., Vero cells or other monkey 
embryonic mesenchymal or epithelial cells. As stated above, the preferred helper cell line is 293. 

Recently, Racher etal (1995) disclosed improved methods for culturing 293 cells and 
propagating adenovirus. In one format, natural cell aggregates are grown by inoculating 
individual cells into 1 liter siliconized spinner flasks (Techne, Cambridge, UK) containing 100- 
200 ml of medium. Following stirring at 40 rpm, the cell viability is estimated with trypan blue. 
In another format, Fibra-Cel microcarriers (Bibby Sterlin, Stone, UK) (5 g/1) is employed as 
follows. A cell inoculum, resuspended in 5 ml of medium, is added to the carrier (50 ml) in a 
250 ml Erlenmeyer flask and left stationary, with occasional agitation, for 1 to 4 h. The medium 
is then replaced with 50 ml of fresh medium and shaking initiated. For virus production, cells 
are allowed to grow to about 80% confluence, after which time the medium is replaced (to 25% 
of the final volume) and adenovirus added at an MOI of 0.05. Cultures are left stationary 
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overnight, following which the volume is increased to 100% and shaking commenced for another 
72 h. 



In some cases, adenovirus mediated gene delivery to multiple cell types has been found to 
be much less efficient compared to epithelial derived cells. A new adenovirus, AdPK, has been 
constructed to overcome this inefficiency (Wickham etal, 1996). AdPK contains a heparin- 
binding domain that targets the virus to heparan-containing cellular receptors, which are broadly 
expressed in many cell types. Therefore, AdPK delivers genes to multiple cell types at higher 
efficiencies than unmodified adenovirus, thus improving gene transfer efficiency and expanding 
the tissues amenable to efficient adenovirus mediated gene therapy. 

Other than the requirement that the adenovirus vector be replication defective, or at least 
conditionally defective, the nature of the adenovirus vector is not believed to be crucial to the 
successful practice of the invention. The adenovirus may be of any of the 42 different known 
serotypes or subgroups A-F. Adenovirus type 5 of subgroup C is the preferred starting material 
in order to obtain the conditional replication-defective adenovirus vector for use in the present 
invention. This is because Adenovirus type 5 is a human adenovirus about which a great deal of 
biochemical and genetic information is known, and it has historically been used for most 
constructions employing adenovirus as a vector. 

As stated above, the typical vector according to the present invention is replication 
defective and will not have an adenovirus El region. Thus, it will be most convenient to 
introduce the foreign gene expression cassette at the position from which the El -coding 
sequences have been removed. However, the position of insertion of the construct within the 
adenovirus sequences is not critical to the invention. The polynucleotide encoding the gene of 
interest may also be inserted in lieu of the deleted E3 region in E3 replacement vectors as 
described by Karlsson etal (1986) or in the E4 region where a helper cell line or helper virus 
complements the E4 defect (Brough et aL, 1996). 
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Adenovirus growth and manipulation is known to those of skill in the art, and exhibits 
broad host range in vitro and in vivo. This group of viruses can be obtained in high titers, e.g., 
10 9 to 10 11 plaque-forming units per ml, and they are highly infective. The life cycle of 
adenovirus does not require integration into the host cell genome. The foreign genes delivered 
by adenovirus vectors are episomal and, therefore, have low genotoxicity to host cells. No 
severe side effects have been reported in studies of vaccination with wild-type adenovirus 
(Couch et al, 1963; Top etal, 1971), demonstrating their safety and therapeutic potential as in 
vivo gene transfer vectors. 

Adenovirus vectors have been used in eukaryotic gene expression (Levrero etal, 1991; 
Gomez-Foix etal, 1992) and vaccine development (Grunhaus and Horwitz, 1992; Graham and 
Prevec, 1992). Recently, animal studies suggested that recombinant adenovirus could be used 
for gene therapy (Stratford-Perricaudet and Perricaudet, 1991; Stratford-Perricaudet etal, 1991; 
Rich etal, 1993). Studies in administering recombinant adenovirus to different tissues include 
trachea instillation (Rosenfeld etal, 1991; 1992), muscle injection (Ragot etal, 1993), 
peripheral intravenous injections (Herz and Gerard, 1993) and stereotactic inoculation into the 
brain (Le Gal La Salle etal, 1993). Recombinant adenovirus and adeno-associated virus (see 
below) can both infect and transduce non-dividing human primary cells. 

2. AAV Vectors 

Adeno-associated virus (AAV) is also an attractive system for use in construction of 
vectors for delivery of and expression of tumor suppressor genes as it has a high frequency of 
integration and it can infect nondividing cells, thus making it useful for delivery of genes into 
mammalian cells, for example, in tissue culture (Muzyczka, 1992) or in vivo. AAV has a broad 
host range for infectivity (Tratschin etal, 1984; Laughlin etal, 1986; Lebkowski etal, 1988; 
McLaughlin etal, 1988). Details concerning the generation and use of r AAV vectors are 
described in U.S. Patent No. 5,139,941 and U.S. Patent No. 4,797,368, each incorporated herein 
by reference. 
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Studies demonstrating the use of AAV in gene delivery include LaFace et al (1988); 
Zhou et al (1993); Flotte et al (1993); and Walsh et al (1994). Recombinant AAV vectors have 
been used successfully for in vitro and in vivo transduction of marker genes (Kaplitt et al, 1994; 
Lebkowski etaL, 1988; Samulski etaL, 1989; Yoder et al, 1994; Zhou etaL, 1994a; Hermonat 
and Muzyczka, 1984; Tratschin etaL, 1985; McLaughlin etaL, 1988) and genes involved in 
human diseases (Flotte etaL, 1992; Luo etaL, 1994; Ohi etaL, 1990; Walsh etaL, 1994; Wei 
etaL, 1994). Recently, an AAV vector has been approved for phase I human trials for the 
treatment of cystic fibrosis. 

AAV is a dependent parvovirus in that it requires coinfection with another virus (either 
adenovirus or a member of the herpes virus family) to undergo a productive infection in cultured 
cells (Muzyczka, 1992). In the absence of coinfection with helper virus, the wild type AAV 
genome integrates through its ends into human chromosome 19 where it resides in a latent state 
as a pro virus (Kotin etaL, 1990; Samulski etaL, 1991). rAAV, however, is not restricted to 
chromosome 19 for integration unless the AAV Rep protein is also expressed (Shelling and 
Smith, 1994). When a cell carrying an AAV provirus is superinfected with a helper virus, the 
AAV genome is "rescued" from the chromosome or from a recombinant plasmid, and a normal 
productive infection is established (Samulski etaL, 1989; McLaughlin etaL, 1988; Kotin etaL, 
1990; Muzyczka, 1992). 

Typically, recombinant AAV (rAAV) virus is made by cotransfecting a plasmid 
containing the gene of interest flanked by the two AAV terminal repeats (McLaughlin etaL, 
1988; Samulski etaL, 1989; each incorporated herein by reference) and an expression plasmid 
containing the wild type AAV coding sequences without the terminal repeats, for example 
pIM45 (McCarty etaL, 1991; incorporated herein by reference). The cells are also infected or 
transfected with adenovirus or plasmids carrying the adenovirus genes required for AAV helper 
function. rAAV virus stocks made in such fashion are contaminated with adenovirus which must 
be inactivated by heat shock or physically separated from the rAAV particles (for example, by 
cesium chloride density centrifugation). Alternatively, adenovirus vectors containing the AAV 
coding regions or cell lines containing the AAV coding regions and some or all of the adenovirus 
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helper genes could be used (Yang et al 9 1994; Clark et al, 1995). Cell lines carrying the rAAV 
DNA as an integrated pro virus can also be used (Flotte et aL, 1995). 

3. Retroviral Vectors 

In particular aspects of the present invention, delivery of selected genes to target cells 
through the use of retrovirus infection will be desired. The retroviruses are a group of single- 
stranded RNA viruses characterized by an ability to convert their RNA to double-stranded DNA 
in infected cells by a process of reverse-transcription (Coffin, 1990). The resulting DNA then 
stably integrates into cellular chromosomes as a provirus and directs synthesis of viral proteins. 
The integration results in the retention of the viral gene sequences in the recipient cell and its 
descendants. The retroviral genome contains three genes, gag, pol, and env that code for capsid 
proteins, polymerase enzyme, and envelope components, respectively. A sequence found 
upstream from the gag gene contains a signal for packaging of the genome into virions. Two 
long terminal repeat (LTR) sequences are present at the 5' and 3' ends of the viral genome. These 
contain strong promoter and enhancer sequences and are also required for integration in the host 
cell genome (Coffin, 1990). 

In order to construct a retroviral vector, a nucleic acid encoding a gene of interest is 
inserted into the viral genome in the place of certain viral sequences to produce a virus that is 
replication-defective. In order to produce virions, a packaging cell line containing the gag, pol, 
and env genes but without the LTR and packaging components is constructed (Mann etal 9 
1983). When a recombinant plasmid containing a cDNA, together with the retroviral LTR and 
packaging sequences is introduced into this cell line (by calcium phosphate precipitation for 
example), the packaging sequence allows the RNA transcript of the recombinant plasmid to be 
packaged into viral particles, which are then secreted into the culture media (Nicolas and 
Rubenstein, 1988; Temin, 1986; Mann etal, 1983). The media containing the recombinant 
retroviruses is then collected, optionally concentrated, and used for gene transfer. Retroviral 
vectors are able to infect a broad variety of cell types. However, integration and stable 
expression require the division of host cells (Paskind et al 9 1975). 
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Concern with the use of defective retrovirus vectors is the potential appearance of wild- 
type replication-competent virus in the packaging cells. This can result from recombination 
events in which the intact sequence from the recombinant virus inserts upstream from the gag, 
pol, env sequence integrated in the host cell genome. However, new packaging cell lines are 
5 now available that should greatly decrease the likelihood of recombination (Markowitz etal, 
1988; Hersdorffer etal, 1990). 

In some cases, the restricted host-cell range and low titer of retroviral vectors can limit 
their use for stable gene transfer in eukaryotic cells. To overcome these potential difficulties, a 
10 murine leukemia virus-derived vector has been developed in which the retroviral envelope 
glycoprotein has been completely replaced by the G glycoprotein of vesicular stomatitis virus 
(Burns etal, 1993). These vectors can be concentrated to extremely high titers (10 9 colony 
yj forming units/ml), and can infect cells that are ordinarily resistant to infection with vectors 
jfy containing the retroviral envelope protein. These vectors may facilitate gene therapy model 
|5 studies and other gene transfer studies that require direct delivery of vectors in vivo. 

4. Baculoviral Vectors 

STs Baculovirus expression vectors are useful tools for the production of proteins for a variety 

^ of applications (Summers and Smith, 1987; O'Reilly et aL, 1992; also U.S. Patent Nos., 
W 4,745,051 (Smith and Summers), 4,879,236 (Smith and Summers), 5,077,214 (Guarino and 
Jarvis), 5,155,037 (Summers), 5,162,222, (Guarino and Jarvis), 5,169,784 (Summers and Oker- 
Blom) and 5,278,050 (Summers), each incorporated herein by reference). The inventors 
contemplate the construction of baculoviral expression vectors wherein gene expression is 
regulated by tetracycline. These vectors might be particularly useful, for example, where the 
25 desired protein is toxic to the insect cells. In these instances, production of the protein can be 
turned off until the cells have reached a very high density, thereby still allowing for the 
production of large quantities of the desired protein. 

Baculovirus expression vectors are recombinant insect vectors in which the coding region 
30 of a particular gene of interest is placed behind a promoter in place of a nonessential baculoviral 
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gene. The classic approach used to isolate a recombinant baculovirus expression vector is to 
construct a plasmid in which the foreign gene of interest is positioned downstream of the 
polyhedrin promoter. Then, via homologous recombination, that plasmid can be used to transfer 
the new gene into the viral genome in place of the wild-type polyhedrin gene (Summers and 
Smith, 1987; O'Reilly etaL, 1992). 

The resulting recombinant virus can infect cultured lepidopteran insect cells or larvae and 
express the foreign gene under the control of the polyhedrin promoter, which is strong and 
provides very high levels of transcription during the very late phase of infection. The strength of 
the polyhedrin promoter is an advantage of the use of recombinant baculoviruses as expression 
vectors because it usually leads to the synthesis of large amounts of the foreign gene product 
during infection. 

5. Other viral vectors 

Other viral vectors may be employed for construction of expression vectors in the present 
invention. Vectors derived from viruses such as vaccinia virus (Ridgeway, 1988; Baichwal and 
Sugden, 1986; Coupar etaL, 1988), sindbis virus and herpesviruses may be employed. They 
offer several attractive features for various mammalian cells (Friedmann, 1989; Ridgeway, 1988; 
Baichwal and Sugden, 1986; Coupar et al 9 1988; Horwich et al 9 1990). 

With the recent recognition of defective hepatitis B viruses, new insight was gained into 
the structure-function relationship of different viral sequences. In vitro studies showed that the 
virus could retain the ability for helper-dependent packaging and reverse transcription despite the 
deletion of up to 80% of its genome (Horwich et al 9 1990). This suggested that large portions of 
the genome could be replaced with foreign genetic material. Chang etaL (1991) recently 
introduced the chloramphenicol acetyltransferase (CAT) gene into duck hepatitis B virus genome 
in the place of the polymerase, surface, and pre-surface coding sequences. It was cotransfected 
with wild-type virus into an avian hepatoma cell line. Culture media containing high titers of the 
recombinant virus were used to infect primary duckling hepatocytes. Stable CAT gene 
expression was detected for at least 24 days after transfection (Chang etaL, 1991). 
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6. Modified Viruses 

In still further embodiments of the present invention, particularly wherein delivery of a 
selected gene to a specific cell type is desired, the expression constructs to be delivered are 
5 housed within an infective virus that has also been engineered to express a specific binding 
ligand. The virus particle will thus bind specifically to the cognate receptors of the target cell 
and deliver the contents to the cell. A novel approach designed to allow specific targeting of 
retrovirus vectors was recently developed based on the chemical modification of a retrovirus by 
the chemical addition of lactose residues to the viral envelope. This modification can permit the 
10 specific infection of hepatocytes via sialoglycoprotein receptors. 

Another approach to targeting of recombinant retroviruses was designed in which 
'42 biotinylated antibodies against a retroviral envelope protein and against a specific cell receptor 
jn j were used. The antibodies were coupled via the biotin components by using streptavidin (Roux 
%i et ah, 1989). Using antibodies against major histocompatibility complex class I and class II 
^ ■ antigens, they demonstrated the infection of a variety of human cells that bore those surface 
s antigens with an ecotropic virus in vitro (Roux et ah , 1989). 

J C. Other Methods of DNA Delivery 

2© As well as the viral mediated methods of DNA delivery via infection of cells described 

above, other methods of introducing the tumor suppressor genes of the present invention into 
both prokaryotic and eukaryotic cells are contemplated. 

1. Transfection and Transformation 

25 In order to effect expression of a gene construct, the expression construct must be 

delivered into a cell. As described herein, a preferred mechanism for delivery is via viral 
infection, where the expression construct is encapsidated in an infectious viral particle. 
However, several non-viral methods for the transfer of expression constructs into eukaryotic and 
prokaryotic cells also are contemplated by the present invention. In one embodiment of the 

30 present invention, the expression construct may consist only of naked recombinant DNA or 
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plasmids. Transfer of the construct may be performed by any of the methods mentioned which 
physically or chemically permeabilize the cell membrane. 

a. Liposome-Mediated Transfection and Transformation 

5 In a further embodiment of the invention, the expression construct may be entrapped in a 

liposome. Liposomes are vesicular structures characterized by a phospholipid bilayer membrane 
and an inner aqueous medium. Multilamellar liposomes have multiple lipid layers separated by 
aqueous medium. They form spontaneously when phospholipids are suspended in an excess of 
aqueous solution. The lipid components undergo self-rearrangement before the formation of 
10 closed structures and entrap water and dissolved solutes between the lipid bilayers (Ghosh and 
Bachhawat, 1991). Also contemplated is an expression construct complexed with Lipofectamine 
(Gibco BRL). 

fi Liposome-mediated nucleic acid delivery and expression of foreign DNA in vitro has 

tS been very successful (Nicolau and Sene, 1982; Fraley et aL, 1979; Nicolau et al. 9 1987). Wong 
ti l et al (1980) demonstrated the feasibility of liposome-mediated delivery and expression of 
3 foreign DNA in cultured chick embryo, HeLa and hepatoma cells. 

^ In certain embodiments of the invention, the liposome may be complexed with a 

2§ hemagglutinating virus (HVJ). This has been shown to facilitate fusion with the cell membrane 
and promote cell entry of liposome-encapsulated DNA (Kaneda etal^ 1989). In other 
embodiments, the liposome may be complexed or employed in conjunction with nuclear non- 
histone chromosomal proteins (HMG-1) (Kato etal, 1991). In yet further embodiments, the 
liposome may be complexed or employed in conjunction with both HVJ and HMG-1 . 

25 

b. Electroporation 

In certain embodiments of the present invention, the expression construct is introduced 
into the cell via electroporation. Electroporation involves the exposure of a suspension of cells 
and DNA to a high-voltage electric discharge. 

30 
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Transfection of eukaryotic cells using electroporation has been quite successful. Mouse 
pre-B lymphocytes have been transfected with human kappa-immunoglobulin genes (Potter 
etal, 1984), and rat hepatocytes have been transfected with the chloramphenicol 
acetyltransferase gene (Tur-Kaspa et al y 1986) in this manner. 

c. Calcium Phosphate Precipitation or DEAE-Dextran Treatment 

In other embodiments of the present invention, the expression construct is introduced to 
the cells using calcium phosphate precipitation. Human KB cells have been transfected with 
adenovirus 5 DNA (Graham and Van Der Eb, 1973) using this technique. Also in this manner, 
mouse L(A9), mouse C127, CHO, CV-1, BHK, NIH3T3 and HeLa cells were transfected with a 
neomycin marker gene (Chen and Okayama, 1987), and rat hepatocytes were transfected with a 
variety of marker genes (Rippe et al, 1990). 

In another embodiment, the expression construct is delivered into the cell using DEAE- 
dextran followed by polyethylene glycol. In this manner, reporter plasmids were introduced into 
mouse myeloma and erythroleukemia cells (Gopal, 1985). 

d. Particle Bombardment 

Another embodiment of the invention for transferring a naked DNA expression construct 
into cells may involve particle bombardment. This method depends on the ability to accelerate 
DNA-coated microprojectiles to a high velocity allowing them to pierce cell membranes and 
enter cells without killing them (Klein etal, 1987). Several devices for accelerating small 
particles have been developed. One such device relies on a high voltage discharge to generate an 
electrical current, which in turn provides the motive force (Yang etal, 1990). The 
microprojectiles used have consisted of biologically inert substances such as tungsten or gold 
beads. 

e. Direct Microinjection or Sonication Loading 

Further embodiments of the present invention include the introduction of the expression 
construct by direct microinjection or sonication loading. Direct microinjection has been used to 
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introduce nucleic acid constructs into Xenopus oocytes (Harland and Weintraub, 1985), and 

LTK" fibroblasts have been transfected with the thymidine kinase gene by sonication loading 
(Fechheimer et al y 1987). 

5 f. Adenoviral Assisted Transfection 

In certain embodiments of the present invention, the expression construct is introduced 
into the cell using adenovirus assisted transfection. Increased transfection efficiencies have been 
reported in cell systems using adenovirus coupled systems (Kelleher and Vos ? 1994; Cotten 
etaL, 1992; Curiel, 1994). 

10 

g. Receptor Mediated Transfection 

F ,„ Still further expression constructs that may be employed to deliver the construct to the 

y3 target cells are receptor-mediated delivery vehicles. These take advantage of the selective uptake 
ry of macromolecules by receptor-mediated endocytosis that will be occurring in the target cells. In 
%S view of the cell type-specific distribution of various receptors, this delivery method adds a 
U} degree of specificity to the present invention. Specific delivery in the context of another 
» mammalian cell type is described by Wu and Wu (1993; incorporated herein by reference). 



r Z Certain receptor-mediated gene targeting vehicles comprise a cell receptor-specific ligand 

W and a DNA-binding agent. Others comprise a cell receptor-specific ligand to which the DNA 
construct to be delivered has been operatively attached. Several ligands have been used for 
receptor-mediated gene transfer (Wu and Wu, 1987; Wagner etaL, 1990; Perales etaL, 1994; 
Myers, EPO 0273085), which establishes the operability of the technique. In the context of the 
present invention, the ligand will be chosen to correspond to a receptor specifically expressed on 
25 the neuroendocrine target cell population. 

In other embodiments, the DNA delivery vehicle component of a cell-specific gene 
targeting vehicle may comprise a specific binding ligand in combination with a liposome. The 
nucleic acids to be delivered are housed within the liposome and the specific binding ligand is 
30 functionally incorporated into the liposome membrane. The liposome will thus specifically bind 
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to the receptors of the target cell and deliver the contents to the cell. Such systems have been 
shown to be functional using systems in which, for example, epidermal growth factor (EGF) is 
used in the receptor-mediated delivery of a nucleic acid to cells that exhibit upregulation of the 
EGF receptor. 

5 

In still further embodiments, the DNA delivery vehicle component of the targeted 
delivery vehicles may be a liposome itself, which will preferably comprise one or more lipids or 
glycoproteins that direct cell-specific binding. For example, Nicolau et al (1987) employed 
lactosyl-ceramide, a galactose-terminal asialoganglioside, incorporated into liposomes and 
10 observed an increase in the uptake of the insulin gene by hepatocytes. It is contemplated that the 
tissue-specific transforming constructs of the present invention can be specifically delivered into 
the target cells in a similar manner. 

pjlj D. Marker Genes 

IB In certain aspects of the present invention, specific cells are tagged with specific genetic 

U! markers to provide information about the fate of the tagged cells. Therefore, the present 
3 "" invention also provides recombinant candidate screening and selection methods which are based 
zf. upon whole cell assays and which, preferably, employ a reporter gene that confers on its 
I s * recombinant hosts a readily detectable phenotype that emerges only under conditions where a 
2§ general DNA promoter positioned upstream of the reporter gene is functional Generally, 
reporter genes encode a polypeptide (marker protein) not otherwise produced by the host cell 
which is detectable by analysis of the cell culture, e.g., by fluorometric, radioisotopic or 
spectrophotometric analysis of the cell culture. 

25 In other aspects of the present invention, a genetic marker is provided which is detectable 

by standard genetic analysis techniques, such as DNA or RNA amplification by PCR™ or 
hybridization using fluorometric, radioisotopic or spectrophotometric probes. 
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1. Screening 

Exemplary enzymes include esterases, phosphatases, proteases (tissue plasminogen 
activator or urokinase) and other enzymes capable of being detected by their activity, as will be 
known to those skilled in the art. Contemplated for use in the present invention is green 
5 fluorescent protein (GFP) as a marker for transgene expression (Chalfie et aL, 1994). The use of 
GFP does not need exogenously added substrates, only irradiation by near UV or blue light, and 
thus has significant potential for use in monitoring gene expression in living cells. 

Other particular examples are the enzyme chloramphenicol acetyltransferase (CAT) 
10 which may be employed with a radiolabeled substrate, firefly and bacterial luciferase, and the 
bacterial enzymes p-galactosidase and P-glucuronidase. Other marker genes within this class are 
well known to those of skill in the art, and are suitable for use in the present invention. 

SI 2. Selection 

15 Another class of reporter genes which confer detectable characteristics on a host cell are 

til those which encode polypeptides, generally enzymes, which render their transfbrmants resistant 
J against toxins. Examples of this class of reporter genes are the neo gene (Colberre-Garapin 
t;: etal, 1981) which protects host cells against toxic levels of the antibiotic G418, the gene 
R : conferring streptomycin resistance (U. S. Patent 4,430,434), the gene conferring hygromycin B 
|| resistance (Santerre etaL, 1984; U. S. Patents 4,727,028, 4,960,704 and 4,559,302), a gene 
encoding dihydrofolate reductase, which confers resistance to methotrexate (Alt et al, 1978), the 
enzyme HPRT, along with many others well known in the art (Kaufman, 1990). 

E. Biological Functional Equivalents 

25 While the present invention contemplates the use of tumor suppressor proteins, 

exemplified by the retinoblastoma protein, which contain modifications within the N-terminal 
region which confer equal or greater tumor suppression activity on the resultant protein, 
alteration of the unmodified C-terminal portion of the protein such that biological activity is 
maintained also falls within the scope of the present invention. 
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As mentioned above, modification and changes may be made in the structure of, for 
example, the retinoblastoma protein, and still obtain a molecule having like or otherwise 
desirable characteristics. For example, certain amino acids may be substituted for other amino 
5 acids in a protein structure without appreciable loss of tumor suppression activity. Since it is the 
interactive capacity and nature of a protein that defines that protein's biological functional 
activity, certain amino acid sequence substitutions can be made in a protein sequence (or, of 
course, its underlying DNA coding sequence) and nevertheless obtain a protein with like 
(agonistic) properties. Equally, the same considerations may be employed to create a protein or 

10 polypeptide with countervailing (e.g., antagonistic) properties. It is thus contemplated by the 
inventors that various changes may be made in the sequence of tumor suppressor proteins or 
peptides (or underlying DNA) without appreciable loss of their biological utility or activity. 

fTn In terms of functional equivalents, It is also well understood by the skilled artisan that, 

11 inherent in the definition of a biologically functional equivalent protein or peptide, is the concept 
Ul that there is a limit to the number of changes that may be made within a defined portion of the 
* molecule and still result in a molecule with an acceptable level of equivalent biological activity. 

Biologically functional equivalent peptides are thus defined herein as those peptides in which 
H b certain, not most or all, of the amino acids may be substituted. Of course, a plurality of distinct 
|| proteins/peptides with different substitutions may easily be made and used in accordance with 
" the invention. 

It is also well understood that where certain residues are shown to be particularly 
important to the biological or structural properties of a protein or peptide, e.g., residues in active 
25 sites, such residues may not generally be exchanged. 

Conservative substitutions well known in the art include, for example, the changes of: 
alanine to serine; arginine to lysine; asparagine to glutamine or histidine; aspartate to glutamate; 
cysteine to serine; glutamine to asparagine; glutamate to aspartate; glycine to proline; histidine to 
30 asparagine or glutamine; isoleucine to leucine or valine; leucine to valine or isoleucine; lysine to 

-45- 



arginine, glutamine, or glutamate; methionine to leucine or isoleucine; phenylalanine to tyrosine, 
leucine or methionine; serine to threonine; threonine to serine; tryptophan to tyrosine; tyrosine to 
tryptophan or phenylalanine; and valine to isoleucine or leucine. 



5 In making such changes, the hydropathic index of amino acids may be considered. Each 

amino acid has been assigned a hydropathic index on the basis of their hydrophobicity and 
charge characteristics, these are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine 
(+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine 
(-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate 
10 (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

The importance of the hydropathic amino acid index in conferring interactive biological 
yi function on a protein is generally understood in the art (Kyte & Doolittle, 1982, incorporated 
SI herein by reference). It is known that certain amino acids may be substituted for other amino 
f B acids having a similar hydropathic index or score and still retain a similar biological activity. In 
yi making changes based upon the hydropathic index, the substitution of amino acids whose 
J" hydropathic indices are within ±2 is preferred, those which are within ±1 are particularly 
*zi preferred, and those within ±0.5 are even more particularly preferred. 

31 It is also understood in the art that the substitution of like amino acids can be made 

effectively on the basis of hydrophilicity. U.S. Patent 4,554,101, incorporated herein by 
reference, states that the greatest local average hydrophilicity of a protein, as governed by the 
hydrophilicity of its adjacent amino acids, correlates with its immunogenicity and antigenicity, 
i.e. with a biological property of the protein, use this shorter portion for non-immunological 

25 stuff It is understood that an amino acid can be substituted for another having a similar 
hydrophilicity value and still obtain a biologically equivalent, and in particular, an 
immunologically equivalent protein. 

As detailed in U.S. Patent 4,554,101, the following hydrophilicity values have been 
30 assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); glutamate 
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(+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (-0.4); 
proline (-0.5 ± 1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); 
leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). 

In making changes based upon similar hydrophilicity values, the substitution of amino 
acids whose hydrophilicity values are within ±2 is preferred, those which are within ±1 are 
particularly preferred, and those within ±0.5 are even more particularly preferred. 

While discussion has focused on functionally equivalent polypeptides arising from amino 
acid changes, it will be appreciated that these changes may be effected by alteration of the 
encoding DNA; taking into consideration also that the genetic code is degenerate and that two or 
more codons may code for the same amino acid. Two tables of amino acids and their codons is 
presented below for use in such embodiments, as well as for other uses, such as in the design of 
probes and primers and the like. 
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Table 1 - Preferred Human DNA Codons 



Amino Acids Codons 
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Codon prevalence shown as decreasing from left (most prevalent) to right (least prevalent). 
Underlined codons are those used less than 5 times per one thousand codons. 
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Table 2 - Preferred Human RNA Codons 



Amino Acids Codons 
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Codon prevalence shown as decreasing from left (most prevalent) to right (least prevalent). 
Underlined codons are those used less than 5 times per one thousand codons. 



F. Mutagenesis 

Mutagenesis may be performed in accordance with any of the techniques known in the art 
such as and not limited to synthesizing an oligonucleotide having one or more mutations within 
the sequence of a particular tumor suppressor or cytokine protein. In particular, site-specific 
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mutagenesis is a technique useful in the preparation of individual peptides, or biologically 
functional equivalent proteins or peptides, through specific mutagenesis of the underlying DNA. 
The technique further provides a ready ability to prepare and test sequence variants, for example, 
incorporating one or more of the foregoing considerations, by introducing one or more nucleotide 
5 sequence changes into the DNA. 

Site-specific mutagenesis allows the production of mutants through the use of specific 
oligonucleotide sequences which encode the DNA sequence of the desired mutation, as well as a 
sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size and 

10 sequence complexity to form a stable duplex on both sides of the deletion junction being 
traversed. Typically, a primer of about 17 to about 75 nucleotides or more in length is preferred, 
with about 10 to about 25 or more residues on both sides of the junction of the sequence being 

J3 altered. 

13 In general, the technique of site-specific mutagenesis is well known in the art, as 

U1 exemplified by various publications. As will be appreciated, the technique typically employs a 
J" phage vector which exists in both a single stranded and double stranded form. Typical vectors 
bri useful in site-directed mutagenesis include vectors such as the Ml 3 phage. These phage are 
H readily commercially available and their use is generally well known to those skilled in the art. 
If Double stranded plasmids are also routinely employed in site directed mutagenesis which 
* & " eliminates the step of transferring the gene of interest from a plasmid to a phage. 

In general, site-directed mutagenesis in accordance herewith is performed by first 
obtaining a single-stranded vector or melting apart of two strands of a double stranded vector 

25 which includes within its sequence a DNA sequence which encodes the desired peptide. An 
oligonucleotide primer bearing the desired mutated sequence is prepared, generally synthetically. 
This primer is then annealed with the single-stranded vector, and subjected to DNA polymerizing 
enzymes such as E. coli polymerase I Klenow fragment, in order to complete the synthesis of the 
mutation-bearing strand. Thus, a heteroduplex is formed wherein one strand encodes the original 

30 non-mutated sequence and the second strand bears the desired mutation. This heteroduplex 
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vector is then used to transform or transfect appropriate cells, such as E. coli cells, and clones are 
selected which include recombinant vectors bearing the mutated sequence arrangement. A 
genetic selection scheme was devised by Kunkel et al (1987) to enrich for clones incorporating 
the mutagenic oligonucleotide. 

5 

Alternatively, the use of PCR™ with commercially available thermostable enzymes such 
as Taq polymerase may be used to incorporate a mutagenic oligonucleotide primer into an 
amplified DNA fragment that can then be cloned into an appropriate cloning or expression 
vector. The PCR™-mediated mutagenesis procedures of Tomic et al (1990) and Upender et al 

10 (1995) provide two examples of such protocols. A PCR™ employing a thermostable ligase in 
addition to a thermostable polymerase may also be used to incorporate a phosphorylated 
mutagenic oligonucleotide into an amplified DNA fragment that may then be cloned into an 

yi appropriate cloning or expression vector. The mutagenesis procedure described by Michael 

SI (1994) provides an example of one such protocol. 

if 

U1 The preparation of sequence variants of the selected peptide-encoding DNA segments 

; " using site-directed mutagenesis is provided as a means of producing potentially useful species 

^ and is not meant to be limiting as there are other ways in which sequence variants of peptides and 

m the DNA sequences encoding them may be obtained. For example, recombinant vectors 

|| encoding the desired peptide sequence may be treated with mutagenic agents, such as 
hydroxylamine, to obtain sequence variants. 

As used herein, the term "oligonucleotide directed mutagenesis procedure" refers to 
template-dependent processes and vector-mediated propagation which result in an increase in the 

25 concentration of a specific nucleic acid molecule relative to its initial concentration, or in an 
increase in the concentration of a detectable signal, such as amplification. As used herein, the 
term "oligonucleotide directed mutagenesis procedure" is intended to refer to a process that 
involves the template-dependent extension of a primer molecule. The term template dependent 
process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the sequence of 

30 the newly synthesized strand of nucleic acid is dictated by the well-known rules of 
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complementary base pairing (see, for example, Watson, 1987). Typically, vector mediated 
methodologies involve the introduction of the nucleic acid fragment into a DNA or RNA vector, 
the clonal amplification of the vector, and the recovery of the amplified nucleic acid fragment. 
Examples of such methodologies are provided by U.S. Patent 4,237,224, specifically 
5 incorporated herein by reference in its entirety. 

G. Pharmaceutically Acceptable Compositions and Routes of Administration 

Where clinical applications are contemplated, it will be necessary to prepare 
pharmaceutical compositions of the proteins, nucleic acids, including vectors such as 
10 tetracycline-regulated vectors, recombinant viruses and cells in a form appropriate for the 
intended application. Generally, this will entail preparing compositions that are essentially free 
of pyrogens, as well as other impurities that could be harmful to humans or animals. 

«1 One will generally desire to employ appropriate salts and buffers to render the 

15 compositions suitable for introduction into a patient. Aqueous compositions of the present 
Ul invention comprise an effective amount of the therapeutic agent dissolved or dispersed in a 
a " pharmaceutically acceptable carrier or aqueous medium, and preferably encapsulated. The 
«1 phrase "pharmaceutically or pharmacologically acceptable" refer to molecular entities and 
H 5 compositions that do not produce adverse, allergic, or other untoward reactions when 
3§ administered to an animal or a human. As used herein, "pharmaceutically acceptable carrier" 
includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents and the like. The use of such media and agents for 
pharmaceutically active substances is well know in the art. Except insofar as any conventional 
media or agent is incompatible with the vectors or cells of the present invention, its use in 
25 therapeutic compositions is contemplated. Supplementary active ingredients, such as other anti- 
cancer agents, can also be incorporated into the compositions. 

Solutions of the active ingredients as free base or pharmacologically acceptable salts can 
be prepared in water suitably mixed with surfactant, such as hydroxypropylcellulose. 
30 Dispersions also can be prepared in glycerol, liquid polyethylene glycols, mixtures thereof and in 
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oils. Under ordinary conditions of storage and use, these preparations contain a preservative to 
prevent growth of microorganisms. Intravenous vehicles include fluid and nutrient replenishers. 
Preservatives include antimicrobial agents, anti-oxidants, chelating agents and inert gases. The 
pH and exact concentration of the various components in the pharmaceutical are adjusted 
5 according to well-known parameters. 

An effective amount of the viruses or cells is determined based on the intended goal. The 
term "unit dose" refers to a physically discrete unit suitable for use in a subject, each unit 
containing a predetermined quantity of the therapeutic composition calculated to produce the 

10 desired response in association with its administration, i.e., the appropriate route and treatment 
regimen. The quantity to be administered, both according to number of treatments and unit dose, 
depends on the subject to be treated, the state of the subject, and the protection desired. Precise 

yi amounts of the therapeutic composition also depend on the judgment of the practitioner and are 

Slj peculiar to each individual. 

1 

U 1 1. Parenteral Administration 

J" The active compositions of the present invention will often be formulated for parenteral 

I s : administration, e.g., formulated for injection via the intravenous, intramuscular, sub-cutaneous, 
¥ h intratumoral, peritumoral or even intraperitoneal routes. The preparation of an aqueous 
|| composition that contains a second agent(s) as active ingredients will be known to those of skill 
in the art in light of the present disclosure. Typically, such compositions can be prepared as 
injectables, either as liquid solutions or suspensions; solid forms suitable for using to prepare 
solutions or suspensions upon the addition of a liquid prior to injection can also be prepared; and 
the preparations can also be emulsified. 

25 

Solutions of the active compounds as free base or pharmacologically acceptable salts can 
be prepared in water suitably mixed with a surfactant, such as hydroxypropylcellulose. 
Dispersions can also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof 
and in oils. Under ordinary conditions of storage and use, these preparations contain a 
30 preservative to prevent the growth of microorganisms. 
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The pharmaceutical forms suitable for injectable use include sterile aqueous solutions or 
dispersions; formulations including sesame oil, peanut oil or aqueous propylene glycol; and 
sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersions. 
5 In all cases the form must be sterile and must be fluid to the extent that easy syringability exists. 
It must be stable under the conditions of manufacture and storage and must be preserved against 
the contaminating action of microorganisms, such as bacteria and fungi. 

The active compounds may be formulated into a composition in a neutral or salt form. 
10 Pharmaceutically acceptable salts, include the acid addition salts (formed with the free amino 
groups of the protein) and which are formed with inorganic acids such as, for example, 
hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and 
L j3 the like. Salts formed with the free carboxyl groups can also be derived from inorganic bases 
SI such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such 
If organic bases as isopropylamine, trimethylamine, histidine, procaine and the like. 

J" The carrier can also be a solvent or dispersion medium containing, for example, water, 

ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the 
h& like), suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, for 
21) example, by the use of a coating, such as lecithin, by the maintenance of the required particle size 
w in the case of dispersion and by the use of surfactants. The prevention of the action of 
microorganisms can be brought about by various antibacterial ad antifungal agents, for example, 
parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged 
25 absorption of the injectable compositions can be brought about by the use in the compositions of 
agents delaying absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active compounds in the 
required amount in the appropriate solvent with various of the other ingredients enumerated 
30 above, as required, followed by filtered sterilization. Generally, dispersions are prepared by 
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incorporating the various sterilized active ingredients into a sterile vehicle which contains the 
basic dispersion medium and the required other ingredients from those enumerated above. In the 
case of sterile powders for the preparation of sterile injectable solutions, the particular methods 
of preparation are vacuum-drying and freeze-drying techniques which yield a powder of the 
5 active ingredient plus any additional desired ingredient from a previously sterile-filtered solution 
thereof. 

For parenteral administration in an aqueous solution, for example, the solution should be 
suitably buffered if necessary and the liquid diluent first rendered isotonic with sufficient saline 
10 or glucose. These particular aqueous solutions are especially suitable for intravenous, 
intramuscular, subcutaneous, intratumoral, peritumoral and intraperitoneal administration. In 
this connection, sterile aqueous media which can be employed will be known to those of skill in 
yi the art in light of the present disclosure. For example, one dosage could be dissolved in 1 ml of 
^1 isotonic NaCl solution and either added to 1000 ml of hypodermoclysis fluid or injected at the 
!§ proposed site of infusion, (see for example, "Remington's Pharmaceutical Sciences" 15th 
yi Edition, pages 1035-1038 and 1570-1580). Some variation in dosage will necessarily occur 
T depending on the condition of the subject being treated. The person responsible for 
!H: administration will, in any event, determine the appropriate dose for the individual subject. 

|| 2. Other Routes of Administration 

In addition to the compounds formulated for parenteral administration, such as 
intravenous or intramuscular injection, other pharmaceutically acceptable forms include, e.g., 
tablets or other solids for oral administration; time release capsules; and any other form currently 
used, including cremes, lotions, mouthwashes, inhalants and the like. 

25 

The expression vectors and delivery vehicles of the present invention may include classic 
pharmaceutical preparations. Administration of these compositions according to the present 
invention will be via any common route so long as the target tissue is available via that route. 
This includes oral, nasal, buccal, rectal, vaginal or topical. Alternatively, administration may be 
30 by orthotopic, intradermal, subcutaneous, intramuscular, intraperitoneal or intravenous injection. 
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The injection can be general, regional, local or direct injection, for example, of a tumor. Also 
contemplated is injection of a resected tumor bed, and continuous perfusion via catheter. Such 
compositions would normally be administered as pharmaceutically acceptable compositions, 
described supra, 

5 

The vectors of the present invention are advantageously administered in the form of 
injectable compositions either as liquid solutions or suspensions; solid forms suitable for solution 
in, or suspension in, liquid prior to injection also may be prepared. These preparations also may 
be emulsified. A typical compositions for such purposes comprises a 50 mg or up to about 100 
10 mg of human serum albumin per milliliter of phosphate buffered saline. Other pharmaceutically 
acceptable carriers include aqueous solutions, non-toxic excipients, including salts, preservatives, 
buffers and the like. Examples of non-aqueous solvents are propylene glycol, polyethylene 
yi glycol, vegetable oil and injectable organic esters, such as theyloleate. Aqueous carriers include 
51 water, alcoholic/aqueous solutions, saline solutions, parenteral vehicles such as sodium chloride, 
fif Ringer's dextrose, etc. Intravenous vehicles include fluid and nutrient replenishers. 
|ji Preservatives include antimicrobial agents, anti-oxidants, chelating agents and inert gases. The 
.7 pH and exact concentration of the various components in the pharmaceutical are adjusted 
bi according to well known parameters. 

|| Additional formulations are suitable for oral administration. Oral formulations include 

* J such typical excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, 
magnesium stearate, sodium saccharine, cellulose, magnesium carbonate and the like. The 
compositions take the form of solutions, suspensions, tablets, pills, capsules, sustained release 
formulations or powders. When the route is topical, the form may be a cream, ointment, salve or 
25 spray. 

An effective amount of the therapeutic agent is determined based on the intended goal. 
The term "unit dose" refers to a physically discrete unit suitable for use in a subject, each unit 
containing a predetermined quantity of the therapeutic composition calculated to produce the 
30 desired response in association with its administration, Le. 9 the appropriate route and treatment 
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regimen. The quantity to be administered, both according to number of treatments and unit dose, 
depends on the subject to be treated, the state of the subject and the protection desired. Precise 
amounts of the therapeutic composition also depend on the judgment of the practitioner and are 
peculiar to each individual. 

5 

In certain cases, the therapeutic formulations of the invention could also be prepared in 
forms suitable for topical administration, such as in cremes and lotions. These forms may be 
used for treating skin-associated diseases, such as various sarcomas. 

10 Upon formulation, solutions will be administered in a manner compatible with the dosage 

formulation and in such amount as is therapeutically effective. The formulations are easily 
administered in a variety of dosage forms, such as the type of injectable solutions described 

y:| above, with even drug release capsules and the like being employable. 

fS EL Chemotherapeutic Agents 

If! The methods of the present invention may be combined with any other methods generally 

J employed in the treatment of the particular disease or disorder that the patient exhibits. For 
si example, in connection with the treatment of solid tumors, the methods of the present invention 
^ may be used in combination with classical approaches, such as surgery, radiotherapy and the 
W like. So long as a particular therapeutic approach is not known to be detrimental in itself, or 
counteracts the effectiveness of the tumor suppressor therapy, its combination with the present 
invention is contemplated. When one or more agents are used in combination with cytokine gene 
therapy and/or tumor suppressor gene therapy, there is no requirement for the combined results 
to be additive of the effects observed when each treatment is conducted separately, although this 
25 is evidently desirable, and there is no particular requirement for the combined treatment to 
exhibit synergistic effects, although this is certainly possible and advantageous. 

In terms of surgery, any surgical intervention may be practiced in combination with the 
present invention. In connection with radiotherapy, any mechanism for inducing DNA damage 
30 locally within tumor cells is contemplated, such as y-irradiation, X-rays, UV-irradiation, 
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microwaves and even electronic emissions and the like. The directed delivery of radioisotopes to 
tumor cells is also contemplated, and this may be used in connection with a targeting antibody or 
other targeting means. Cytokine therapy also has proven to be an effective partner for combined 
therapeutic regimens. Various cytokines may be employed in such combined approaches. 

5 Examples of cytokines include IL-la IL-lp, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, 
IL-11, IL-12, IL-13, TGF-p, GM-CSF, M-CSF, G-CSF, TNFa, TNFp, LAF, TCGF, BCGF, 
TRF, BAF, BDG, MP, LIF, OSM, TMF, PDGF, IFN-a, IFN-p, IFN-y. Cytokines are 
administered according to standard regimens, consistent with clinical indications such as the 
condition of the patient and relative toxicity of the cytokine. Below is an exemplary, but in no 

10 way limiting, table of cytokine genes contemplated for use in certain embodiments of the present 
invention. 



Table 3 





Cytokine 


Reference 




human IL-la 


March et al, Nature, 315:641, 1985 




murine IL-la 


Lomedico et al, Nature, 312:458, 1984 




human IL-ip 


March et al, Nature, 315:641, 1985; Auron et al, Proc. Natl Acad. 




Set USA, 81:7907, 1984 




murine IL-1(3 


Gray, J. Immunol, 137:3644, 1986; Telford, Nucl Acids Res., 
14:9955, 1986 




human IL-lra 


Eisenberg et al, Nature, 343:341, 1990 




human IL-2 


Taniguchi et al, Nature, 302:305, 1983; Maeda et al, Biochem. 
Biophys. Res. Commun., 115:1040, 1983 




human IL-2 


Taniguchi et al, Nature, 302:305, 1983 




human IL-3 


Yang et al, Cell, 47:3, 1986 




murine IL-3 


Yokota et al,Proc. Natl. Acad. Sci. USA, 81:1070, 1984; Fung et 
al, Nature, 307:233, 1984; Miyatake et al,Proc. Natl Acad. Sci. 
USA, 82:316, 1985 




human IL-4 


Yokota et al, Proc. Natl Acad. Sci. USA, 83:5894, 1986 




murine IL-4 


Norma et al, Nature, 319:640, 1986; Lee et al, Proc. Natl. Acad. 
Sci. USA, 83:2061, 1986 




human IL-5 


Azuma etal, Nucl Acids Res., 14:9149, 1986 




murine IL-5 


Kinashi et al, Nature, 324:70, 1986; Mizuta et al, Growth Factors, 
1:51, 1988 




human IL-6 


Hirano et al, Nature, 324:73, 1986 




murine IL-6 


Van Snick et al, Eur. J. Immunol, 18:193, 1988 




human IL-7 


Goodwin etal, Proc. Natl. Acad. Sci. USA, 86:302, 1989 
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Cytokine 


Reference 


XT ^7 

murine 1L- / 


iNamen at. , Nature, jjjj / i, i^oo 


human IL-8 


oCnmiU £f t7. ImmUrlOl. , 1 J7.ZjU, 1;70/, IvlcUsUMllllla Ut., tA 

r Yn /UW 167*1Rfn 1 Q8R* T inrHev /?/ Pror Afar// Acad Sci 
fT^/l 85*9199 1988 


numdn uu-y 


Pptianlri Pt nl T Jmmimnl 144*4915 1990 

IvCllclU.HJ.c-/ ui. 7 t/. inifflHH\Jl., m.iLJJ, i y y\J 


murine ll,-? 


RpnmilH ^/ /?/ / Immunol 144*4235 1990 

XVClldllllJ. t>l Ctt., i/. lfflfflUflUt., lT^.T^J.J, LJJ\J 


numan /\ngiogenm 


TCurnrhi Pt nl Rinrhpmi\trv 94*5494 1985 


human GROa 


Richmond 0/., EMBOJ., 7:2025, 1988 


murine MIP-loc 


r>av5itf*1iQ ^ /7/ f F Y n 167-1Q^Q 1988 


murine MIP-lp 


Sherry era/., J. £xp. Med, 168:2251, 1988 


human MIF 


Weiser etal.,Proc. Natl. Acad. Sci. USA, 86:7522, 1989 


human G-CSF 


JNagata a/., Nature, 319.41j, l^oo, oouza er a/. , science, zjz.oi, 


human GM-CSF 


Cantrell et al,Proc. Natl. Acad Sci. USA, 82:6250, 1985; Lee et 
al. 9 Proc. Natl. Acad. Sci. USA, 5z:43oU, 19oj, Wong era/., 
Science, 228:810, 1985 


murine GM-CSF 


Gough et al. , EMBO J. , 4:645, 1 985 


human M-CSF 


Wong, Science, 235:1504, 1987; Kawasaki, Science, 23U;291, 
1985; Ladner, EMBO J. , 6:2693, 1987 


human EGF 


Smith et al.,Nucl. Acids Res., 10:4467, 1982; Bell et al.,Nucl. 
Acids Res., 14:o4z/ ? lyoo 


1 T S~~\ 

human TGF-a 


Derynck e/a/., Ce//, 35:z5/, 19J54 


human FGF acidic 


Jaye ef a/., Science, 233:541, 1986; Gimenez-Gallego et al, 
Biocnem. Biopnys. Res. Lommun., l Jo. oil, lyoo, riarper er 
Biochem., 25:4097, 1986 


human p-ECGF 


Jaye ef «/., Science, 233:541, lyoo 


human FGF basic 


Abraham a/., EMBOJ., 5:2523, 1986; Sommer e? a/., Biochem. 

Biopnys. Res. Comm., 144. j4J, IVo/ 


murine IFN-P 


Higashi ^ a/., J. Ctem., 258:9522, 1983; Kuga, Nucl Acids 
Res., 1 /:iz91, 19o9 


human IFN-y 


Gray aL, Nature, 295:503, 19oz, Uevos al.,NucL Acids Kes., 
10:246/, 19oz; KinaerKnecnt, J. Jtswl. cnem. , Zjy.o fy\), iyo4 


human lur -1 


Jansen al., Nature, Juo.ouy, i^oJ, Kotwem ex ai., j. nioi. \^nem., 
261:4828, 1986 


human lur-11 


rJell et al. , Nature, d l u . / / j , i Vo4 


human P-NGF chain 


unricn et ai., Nature, jvd.ozl, i^oj 


human PDGF A chain 


Betsholtz et al, Nature, 320:695, 1986 


numan r jj<jt r> cnain 


jonnsson ei ai. , ejivil>\j j. , j.7Zi, 1^0*+, v^uimib ui., i\uiurv, 
316:748, 1985 


human TGF-pl 


Derynck a/., Nature, 316:701, 1985 


human TNF-a 


Pennica et al, Nature, 312:724, 1984; Fransen et al.,Nucl. Acids 
Res., 13:4417, 1985 
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Cytokine 


Reference 


human TNF-p 


Qayetal 9 Nature 9 3l2:72l 9 1984 


murine TNF-fJ 


Gray et al 9 Nucl Acids Res. 9 15:3937, 1987 



Compositions of the present invention can have an effective amount of an engineered 
virus or cell for therapeutic administration in combination with an effective amount of a 
compound (second agent) that is a chemotherapeutic agent as exemplified below. Such 
5 compositions will generally be dissolved or dispersed in a pharmaceutically acceptable carrier or 
aqueous medium. A wide variety of chemotherapeutic agents may be used in combination with 
the therapeutic genes of the present invention. These can be, for example, agents that directly 
cross-link DNA, agents that intercalate into DNA, and agents that lead to chromosomal and 
mitotic aberrations by affecting nucleic acid synthesis. 

C Irrespective of the mechanisms by which the enhanced tumor destruction is achieved, the 

m combined treatment aspects of the present invention have evident utility in the effective 
1- treatment of disease. To use the compositions of the present invention in combination with the 
l i* administration of a chemotherapeutic agent, one would simply administer to an animal at least a 
15 first modified retinoblastoma tumor suppressor as disclosed herein in combination with the 
f ; j chemotherapeutic agent in a manner effective to result in their combined anti-tumor actions 
C within the animal. These agents would therefore be provided in an amount effective and for a 
■O period of time effective to result in their combined presence and their combined actions in the 

tumor environment. To achieve this goal, the modified retinoblastoma tumor suppressor and 
20 chemotherapeutic agents may be administered to the animal simultaneously, either in a single 

composition or as two distinct compositions using different administration routes. 

Alternatively, the modified retinoblastoma tumor suppressor treatment may precede or 
follow the chemotherapeutic agent treatment by intervals ranging from minutes to weeks. In 
25 embodiments where the chemotherapeutic factor and modified retinoblastoma tumor suppressor 
are applied separately to the animal, one would generally ensure that a significant period of time 
did not expire between the time of each delivery, such that the chemotherapeutic agent and 
modified retinoblastoma tumor suppressor composition would still be able to exert an 
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advantageously combined effect on the tumor. In such instances, it is contemplated that one 
would contact the tumor with both agents within about 5 minutes to about one week of each 
other and, more preferably, within about 12-72 hours of each other, with a delay time of only 
about 12-48 hours being most preferred. In some situations, it may be desirable to extend the 
5 time period for treatment significantly, where several days (2, 3, 4, 5, 6 or 7) or even several 
weeks (1, 2, 3, 4, 5, 6, 7 or 8) lapse between the respective administrations. It also is conceivable 
that more than one administrations of either the modified retinoblastoma tumor suppressor or the 
chemotherapeutic agent will be desired. To achieve tumor regression, both agents are delivered 
in a combined amount effective to inhibit its growth, irrespective of the times for administration. 

10 

A variety of chemotherapeutic agents are intended to be of use in the combined treatment 
methods disclosed herein. Chemotherapeutic agents contemplated as exemplary include, e.g., 
yj etoposide (VP- 16), adriamycin, 5-fluorouracil (5FU), camptothecin, actinomycin-D, 
SI mitomycin C, cisplatin (CDDP) and even hydrogen peroxide. 

8 

U1 As will be understood by those of ordinary skill in the art, the appropriate doses of 

7 chemotherapeutic agents will be generally around those already employed in clinical therapies 
SI wherein the chemotherapeutics are administered alone or in combination with other 
^ chemotherapeutics. By way of example only, agents such as cisplatin, and other DNA alkylating 
30 may be used. Cisplatin has been widely used to treat cancer, with efficacious doses used in 

clinical applications of 20 mg/m 2 for 5 days every three weeks for a total of three courses. 

Cisplatin is not absorbed orally and must therefore be delivered via injection intravenously, 

subcutaneously, intratumorally or intraperitoneally. 

25 Agents that directly cross-link nucleic acids, specifically DNA, are envisaged and are 

shown herein, to eventuate DNA damage leading to a synergistic antineoplastic combination. 
Agents such as cisplatin, and other DNA alkylating agents may be used. 

Further useful agents include compounds that interfere with DNA replication, mitosis and 
30 chromosomal segregation. Such chemotherapeutic compounds include adriamycin, also known 
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as doxorubicin, etoposide, verapamil, podophyllotoxin, and the like. Widely used in a clinical 
setting for the treatment of neoplasms, these compounds are administered through bolus 
injections intravenously at doses ranging from 25-75 mg/m at 21 day intervals for adriamycin, 
to 35-50 mg/m for etoposide intravenously or double the intravenous dose orally. 

Agents that disrupt the synthesis and fidelity of polynucleotide precursors may also be 
used. Particularly useful are agents that have undergone extensive testing and are readily 
available. As such, agents such as 5-fluorouracil (5-FU) are preferentially used by neoplastic 
tissue, making this agent particularly useful for targeting to neoplastic cells. Although quite 
toxic, 5-FU, is applicable in a wide range of carriers, including topical, however intravenous 
administration with doses ranging from 3 to 15 mg/kg/day being commonly used. 

Plant alkaloids such as taxol are also contemplated for use in certain aspects of the 
present invention. Taxol is an experimental antimitotic agent, isolated from the bark of the ash 
tree, Taxus brevifolia. It binds to tubulin (at a site distinct from that used by the vinca alkaloids) 
and promotes the assembly of microtubules. Taxol is currently being evaluated clinically; it has 
activity against malignant melanoma and carcinoma of the ovary. Maximal doses are 30 mg/m 2 
per day for 5 days or 210 to 250 mg/m given once every 3 weeks. Of course, all of these dosages 
are exemplary, and any dosage in-between these points is also expected to be of use in the 
invention. 

Exemplary chemotherapeutic agents that are useful in connection with combined therapy 
are listed in Table 4. Each of the agents listed therein are exemplary and by no means limiting. 
The skilled artisan is directed to "Remington's Pharmaceutical Sciences" 15th Edition, chapter 
33, in particular pages 624-652. Some variation in dosage will necessarily occur depending on 
the condition of the subject being treated. The person responsible for administration will, in any 
event, determine the appropriate dose for the individual subject. Moreover, for human 
administration, preparations should meet sterility, pyrogenicity, general safety and purity 
standards as required by FDA Office of Biologies standards. 
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Table 4 

Chemotherapeutic Agents Useful In Neoplastic Disease 



Class 


Type Of Agent 


Nonproprietary Names 
[Utner Names) 


Disease 


Alkylating Agents 


Nitrogen Mustards 


Mechlorethamine (HN 2 ) 


Hodgkin's disease, non-Hodgkin's lymphomas 


Cyclophosphamide 
Ifosfamide 


Acute and chronic lymphocytic leukemias, 
Hodgkin's disease, non-Hodgkin's lymphomas, 
multiple myeloma, neuroblastoma, breast, ovary, 
lung, Wilms' tumor, cervix, testis, soft-tissue 
sarcomas 


El/I alnnolon /l CQrrn \rcinl 
IvieipnalaM \L"dai UUiyoill/ 


Multiple myeloma, breast, ovary 


Chlorambucil 


Chronic lymphocytic leukemia, primary 
macroglobulinemia, Hodgkin's disease, non- 
Hodgkin's lymphomas 


Ethylenimenes and 
Methylmelamines 


Hexamethylmelamine 


Ovary 


Thiotepa 


Bladder, breast, ovary 


Alkyl Sulfonates 


Busulfan 


Chronic granulocytic leukemia 


Nitrosoureas 


Carmustine (BCNU) 


Hodgkin's disease, non-Hodgkin's lymphomas, 
primary brain tumors, multiple myeloma, 
malignant melanoma 


Lomustine (CCNU) 


Hodgkin's disease, non-Hodgkin's lymphomas, 
primary brain tumors, small-cell lung 


Semustine (methyl-CCNU) 


Primary brain tumors, stomach, colon 


Streptozocin 
(streptozotocin) 


Malignant pancreatic insulinoma, malignant 
carcinoid 


Triazines 


Dacarbazine (DTIC; 

riiinpth\/ltria7Pnnimififi7 

olecarboxamide) 


Malignant melanoma, Hodgkin's disease, soft- 
tissue sarcomas 


Antimetabolites 


Folic Acid Analogs 


Methotrexate 
(amethopterin) 


Arntp Ivmnhnrvtir Ipiikpmia rhnrinrarrinnma 

rll/UlG 1 y II IfJIIUOy III* ICUrVCIlllO, LrllUI lUbul UtIIUll lu, 

mycosis fungoides, breast, head and neck, lung, 
osteogenic sarcoma 


Pyrimidine Analogs 


Fiimiirari! /R-flfinrnnraril* 
riuuuidLii \%j i iuui uui dLii, 

5-FU) 
Floxuridine (fluorode- 
oxyuridine; FUdR) 


Breast, colon, stomach, pancreas, ovary, head 
and neck, urinary bladder, premalignant skin 
lesions (topical) 


Cytarabine (cytosine 
arabinoside) 


Acute granulocytic and acute lymphocytic 
leukemias 


Purine Analogs and 
Related Inhibitors 


Mercaptopurine 
(6-mercaptopurine; 
6-MP) 


Acute lymphocytic, acute granulocytic and 
chronic granulocytic leukemias 


Thioguanine 

(6-thioguanine; TG) 


Acute granulocytic, acute lymphocytic and 
chronic granulocytic leukemias 
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Class 


Type Of Agent 


Nonproprietary Names 
(Other Names) 


Disease 






Pentostatin 
(2-deoxycoformycin) 


Hairy cell leukemia, mycosis fungoides, chronic 
lymphocytic leukemia 


Natural Products 


Vinca Alkaloids 


Vinblastine (VLB) 

V 11 IUIUO 111 IG \ILU| 


Hodgkin's disease, non-Hodgkin's lymphomas, 
breast, testis 


Vincristine 


Acute lymphocytic leukemia, neuroblastoma, 
Wilms' tumor, rhabdomyosarcoma, Hodgkin's 
disease, non-Hodgkin's lymphomas, small-cell 
lung 


Epipodophyllotoxins 


Etoposide (VP16) 
Tertiposide 


Testis, small-cell lung and other lung, breast, 
Hodgkin's disease, non-Hodgkin's lymphomas, 
acute granulocytic leukemia, Kaposi's sarcoma 


Antibiotics 
Antibiotics, continued 


Dactinomycin 
(actinomycin D) 


Choriocarcinoma, Wilms' tumor, 
rhabdomyosarcoma, testis, Kaposi's sarcoma 


Daunorubicin 
(daunomycin; 
rubidomycin) 


Acute granulocytic and acute lymphocytic 

Iplllf pmiac 
ICUnCII HuO 


Doxorubicin 


Soft-tissue, osteogenic and other sarcomas; 
Hodgkin's disease, non-Hodgkin's lymphomas, 
acute leukemias, breast, genitourinary, thyroid, 
lung, stomach, neuroblastoma 


Bleomycin 


Testis, head and neck, skin, esophagus, lung and 
genitourinary tract; Hodgkin's disease, non- 

■ ■ i ■ * r i l 

Hodgkm s lymphomas 


Plicamycin (mithramycin) 


Testis, malignant hypercalcemia 


Mitomycin (mitomycin C) 


Stomach, cervix, colon, breast, pancreas, 
bladder, head and neck 


Enzymes 


L-Asparaginase 


Acute lymphocytic leukemia 


Biological Response 
Modifiers 


Interferon alfa 


Hairy cell leukemia., Kaposi's sarcoma, 
melanoma, carcinoid, renal cell, ovary, bladder, 
non-Hodgkin's lymphomas, mycosis fungoides, 
multiple myeloma, chronic granulocytic leukemia 


Miscellaneous 
Agents 


Platinum Coordination 
Complexes 


Cisplatin fcvs-DDP) 
Carboplatin 


Testis, ovary, bladder, head and neck, lung, 
thyroid, cervix, endometrium, neuroblastoma, 
osteogenic sarcoma 


Anthracenedione 


Mitoxantrone 


Acute granulocytic leukemia, breast 


Substituted Urea 


Hydroxyurea 


Chronic granulocytic leukemia, polycythemia 
vera, essental thrombocytosis, malignant 
melanoma 


Methyl Hydrazine 
Derivative 


Procarbazine 
(N-methylhydrazine, 
MIH) 


Hodgkin's disease 


Adrenocortical 
Suppressant 


Mitotane (<y>'-DDD) 


Adrenal cortex 


Aminoglutethimide 


Breast 
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Class 


Type Of Agent 


Nonproprietary Names 
(Other Names) 


Disease 


Hormones and 
Antagonists 


Adrenocorticosteroids 


Prednisone (several other 
equivalent preparations 
available) 


Acute and chronic lymphocytic leukemias, non- 
Hodgkin's lymphomas, Hodgkin's disease, breast 


Progestins 


Hydroxyprogesterone 

caproate 
Medroxyprogesterone 

acetate 
Megestrol acetate 


Endometrium, breast 


Estrogens 


Diethylstilbestrol 
Ethinyl estradiol (other 
preparations available) 


Breast, prostate 


Antiestrogen 


Tamoxifen 


Breast 


Annrnnnno 

Miiuruytins 


Testosterone propionate 
Fluoxymesterone (other 
preparations available) 


Breast 


Antiandrogen 


Flutamide 


Prostate 


Gonadotropin-releasing 
hormone analog 


Leuprolide 


Prostate 



I. Protein Purification 

Certain aspects of the present invention concern the purification, and in particular 
embodiments, the substantial purification, of an encoded protein or peptide. The term "purified 
protein or peptide " as used herein, is intended to refer to a composition, isolatable from other 
components, wherein the protein or peptide is purified to any degree relative to its naturally- 
obtainable state. A purified protein or peptide therefore also refers to a protein or peptide, free 
from the environment in which it may naturally occur. 

Generally, "purified" will refer to a protein or peptide composition that has been 
subjected to fractionation to remove various other components, and which composition 
substantially retains its expressed biological activity. Where the term "substantially purified" is 
used, this designation will refer to a composition in which the protein or peptide forms the major 
component of the composition, such as constituting about 50% or more of the proteins in the 
composition. 
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Various methods for quantifying the degree of purification of the protein or peptide will 
be known to those of skill in the art in light of the present disclosure. These include, for 
example, determining the specific activity of an active fraction, or assessing the number of 
polypeptides within a fraction by SDS/PAGE analysis. A preferred method for assessing the 
purity of a fraction is to calculate the specific activity of the fraction, to compare it to the specific 
activity of the initial extract, and to thus calculate the degree of purity, herein assessed by a "- 
fold purification number". The actual units used to represent the amount of activity will, of 
course, be dependent upon the particular assay technique chosen to follow the purification and 
whether or not the expressed protein or peptide exhibits a detectable activity. 

Various techniques suitable for use in protein purification will be well known to those of 
skill in the art. These include, for example, precipitation with ammonium sulphate, PEG, 
antibodies and the like or by heat denaturation, followed by centrifugation; chromatography steps 
such as ion exchange, gel filtration, reverse phase, hydroxylapatite and affinity chromatography; 
isoelectric focusing; gel electrophoresis; and combinations of such and other techniques. As is 
generally known in the art, it is believed that the order of conducting the various purification 
steps may be changed, or that certain steps may be omitted, and still result in a suitable method 
for the preparation of a substantially purified protein or peptide. 

There is no general requirement that the protein or peptide always be provided in their 
most purified state. Indeed, it is contemplated that less substantially purified products will have 
utility in certain embodiments. Partial purification may be accomplished by using fewer 
purification steps in combination, or by utilizing different forms of the same general purification 
scheme. For example, it is appreciated that a cation-exchange column chromatography 
performed utilizing an HPLC apparatus will generally result in a greater -fold purification than 
the same technique utilizing a low pressure chromatography system. Methods exhibiting a lower 
degree of relative purification may have advantages in total recovery of protein product, or in 
maintaining the activity of an expressed protein. 
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It is known that the migration of a polypeptide can vary, sometimes significantly, with 
different conditions of SDS/PAGE (Capaldi etaL, 1977). It will therefore be appreciated that 
under differing electrophoresis conditions, the apparent molecular weights of purified or partially 
purified expression products may vary. 

High Performance Liquid Chromatography (HPLC) is characterized by a very rapid 
separation with extraordinary resolution of peaks. This is achieved by the use of very fine 
particles and high pressure to maintain and adequate flow rate. Separation can be accomplished 
in a matter of minutes, or a most an hour. Moreover, only a very small volume of the sample is 
needed because the particles are so small and close-packed that the void volume is a very small 
fraction of the bed volume. Also, the concentration of the sample need not be very great because 
the bands are so narrow that there is very little dilution of the sample. 

Gel chromatography, or molecular sieve chromatography, is a special type of partition 
chromatography that is based on molecular size. The theory behind gel chromatography is that 
the column, which is prepared with tiny particles of an inert substance that contain small pores, 
separates larger molecules from smaller molecules as they pass through or around the pores, 
depending on their size. As long as the material of which the particles are made does not adsorb 
the molecules, the sole factor determining rate of flow is the size. Hence, molecules are eluted 
from the column in decreasing size, so long as the shape is relatively constant. Gel 
chromatography is unsurpassed for separating molecules of different size because separation is 
independent of all other factors such as pH, ionic strength, temperature, etc. There also is 
virtually no adsorption, less zone spreading and the elution volume is related in a simple matter 
to molecular weight. 

Affinity chromatography is a chromatographic procedure that relies on the specific 
affinity between a substance to be isolated and a molecule that it can specifically bind to. This is 
a receptor-ligand type interaction. The column material is synthesized by covalently coupling 
one of the binding partners to an insoluble matrix. The column material is then able to 
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specifically adsorb the substance from the solution. Elution occurs by changing the conditions to 
those in which binding will not occur (alter pH, ionic strength, temperature, etc.), 

A particular type of affinity chromatography useful in the purification of carbohydrate 
containing compounds is lectin affinity chromatography. Lectins are a class of substances that 
bind to a variety of polysaccharides and glycoproteins. Lectins are usually coupled to agarose by 
cyanogen bromide. Conconavalin A coupled to Sepharose was the first material of this sort to be 
used and has been widely used in the isolation of polysaccharides and glycoproteins other lectins 
that have been include lentil lectin, wheat germ agglutinin which has been useful in the 
purification of N-acetyl glucosaminyl residues and Helix pomatia lectin. Lectins themselves are 
purified using affinity chromatography with carbohydrate ligands. Lactose has been used to 
purify lectins from castor bean and peanuts; maltose has been useful in extracting lectins from 
lentils and jack bean; N-acetyl-D galactosamine is used for purifying lectins from soybean; N- 
acetyl glucosaminyl binds to lectins from wheat germ; D-galactosamine has been used in 
obtaining lectins from clams and L-fucose will bind to lectins from lotus. 

The matrix should be a substance that itself does not adsorb molecules to any significant 
extent and that has a broad range of chemical, physical and thermal stability. The ligand should 
be coupled in such a way as to not affect its binding properties. The ligand should also provide 
relatively tight binding. And it should be possible to elute the substance without destroying the 
sample or the ligand. One of the most common forms of affinity chromatography is 
immunoaffinity chromatography. 

L. Use of Cells in Bioreactors 

The ability to produce biologically active polypeptides is increasingly important to the 
pharmaceutical industry. The present invention discloses compositions and methods for the 
efficient regulated expression of, for example, tumor suppressor genes in cells, allowing for the 
production of these proteins in vitro from previously refractory cell types. 
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Over the last decade, advances in biotechnology have led to the production of important 
proteins and factors from bacteria, yeast, insect cells and from mammalian cell culture. 
Mammalian cultures have advantages over cultures derived from the less advanced lifeforms in 
their ability to post-translationally process complex protein structures such as disulfide- 
5 dependent folding and glycosylation. Indeed, mammalian cell culture is now the preferred 
source of a number of important proteins for use in human and animal medicine, especially those 
which are relatively large, complex or glycosylated. 

Development of mammalian cell culture for production of pharmaceuticals has been 
10 greatly aided by the development in molecular biology of techniques for design and construction 

of vector systems highly efficient in mammalian cell cultures, a battery of useful selection 
^ markers, gene amplification schemes and a more comprehensive understanding of the 

biochemical and cellular mechanisms involved in procuring the final biologically-active 
pi molecule from the introduced vector. 

SI 

JtJ However, the traditional selection of cell types for expressing heterologous proteins has 

generally been limited to the more "common" cell types such as CHO cells, BHK cells, CI 27 
jhj cells and myeloma cells. In many cases, these cell types were selected because there was a great 
^ deal of preexisting literature on the cell type or the cell was simply being carried in the 
10 laboratory at the time the effort was made to express a peptide product. Frequently, factors 
which affect the downstream (e.g., beyond the T-75 flask) side of manufacturing scale-up were 
not considered before selecting the cell line as the host for the expression system. 

Aspects of the present invention take advantage of the biochemical and cellular capacities 
25 of mammalian cells as well as of recently available bioreactor technology. Growing cells 
according to the present invention in a bioreactor allows for large scale production and secretion 
of complex, fully biologically-active polypeptides into the growth media. In particular 
embodiments, by designing a defined media with low contents of complex proteins and using a 
scheme of timed-stimulation of the secretion into the media for increased titer, the purification 
30 strategy can be greatly simplified, thus lowering production cost. 
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1. Anchorage-dependent and non-anchorage-dependent cultures. 

Animal and human cells can be propagated in vitro in two modes: as non-anchorage 
dependent cells growing freely in suspension throughout the bulk of the culture; or as anchorage- 
5 dependent cells requiring attachment to a solid substrate for their propagation (Le., a monolayer 
type of cell growth). 

Non-anchorage dependent or suspension cultures from continuous established cell lines 
are the most widely used means of large scale production of cells and cell products. Large scale 

10 suspension culture based on microbial (bacterial and yeast) fermentation technology has clear 
advantages for the manufacturing of mammalian cell products. The processes are relatively 

^ straightforward to operate and scale up. Homogeneous conditions can be provided in the reactor 
y 3 which allows for precise monitoring and control of temperature, dissolved oxygen, and pH, and 

11 j ensure that representative samples of the culture can be taken. 

However, suspension cultured cells cannot always be used in the production of 
s biologicals. Suspension cultures are still considered to have tumorigenic potential and thus their 
fT; use as substrates for production put limits on the use of the resulting products in human and 
^ veterinary applications (Petricciani, 1985; Larsson and Litwin, 1987). Viruses propagated in 
m suspension cultures as opposed to anchorage-dependent cultures can sometimes cause rapid 
changes in viral markers, leading to reduced immunogenicity (Bahnemann, 1980). Finally, 
sometimes even recombinant cell lines can secrete considerably higher amounts of products 
when propagated as anchorage-dependent cultures as compared with the same cell line in 
suspension (Nilsson and Mosbach, 1987). For these reasons, different types of anchorage- 
25 dependent cells are used extensively in the production of different biological products. 

The current invention includes cells which are anchorage-dependent of nature. 
Anchorage-dependent cells, when grown in suspension, will attach to each other and grow in 
clumps, eventually suffocating cells in the inner core of each clump as they reach a size that 
30 leaves the core cells unsustainable by the culture conditions. Therefore, an efficient means of 
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large-scale culture of anchorage-dependent cells is also provided in order to effectively take 
advantage of the cells' capacity to secrete heterologous proteins. 

2. Reactors and processes for suspension. 

5 Large scale suspension culture of mammalian cultures in stirred tanks is contemplated. 

The instrumentation and controls for bioreactors have been adapted, along with the design of the 
fermentors, from related microbial applications. However, acknowledging the increased demand 
for contamination control in the slower growing mammalian cultures, improved aseptic designs 
have been implemented, improving dependability of these reactors. Instrumentation and controls 

10 include agitation, temperature, dissolved oxygen, and pH controls. More advanced probes and 
autoanalyzers for on-line and off-line measurements of turbidity (a function of particles present), 

^ capacitance (a function of viable cells present), glucose/lactate, carbonate/bicarbonate and carbon 
dioxide are also available. Maximum cell densities obtainable in suspension cultures are 

fa relatively low at about 2-4 x 10 6 cells/ml of medium (which is less than 1 mg dry cell weight per 

ljl ml), well below the numbers achieved in microbial fermentation. 

s Two suspension culture reactor designs are most widely used in the industry due to their 

nj simplicity and robustness of operation - the stirred reactor and the airlift reactor. The stirred 
reactor design has successfully been used on a scale of 8000 liter capacity for the production of 

IS interferon (Phillips etal, 1985; Mizrahi, 1983). Cells are grown in a stainless steel tank with a 
height-to-diameter ratio of 1:1 to 3:1. The culture is usually mixed with one or more agitators, 
based on bladed disks or marine propeller patterns. Agitator systems offering less shear forces 
than blades have been described. Agitation may be driven either directly or indirectly by 
magnetically coupled drives. Indirect drives reduce the risk of microbial contamination through 

25 seals on stirrer shafts. 

The airlift reactor, also initially described for microbial fermentation and later adapted for 
mammalian culture, relies on a gas stream to both mix and oxygenate the culture. The gas 
stream enters a riser section of the reactor and drives circulation. Gas disengages at the culture 
30 surface, causing denser liquid free of gas bubbles to travel downward in the downcomer section 
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of the reactor. The main advantage of this design is the simplicity and lack of need for 
mechanical mixing. Typically, the height-to-diameter ratio is 10:1. The airlift reactor scales up 
relatively readily, has good mass transfer of gasses and generates relatively low shear forces. 

5 Most large-scale suspension cultures are operated as batch or fed-batch processes because 

they are the most straightforward to operate and scale up. However, continuous processes based 
on chemostat or perfusion principles are available. 

A batch process is a closed system in which a typical growth profile is seen. A lag phase 
10 is followed by exponential, stationary and decline phases. In such a system, the environment is 
continuously changing as nutrients are depleted and metabolites accumulate. This makes 
^ analysis of factors influencing cell growth and productivity, and hence optimization of the 
yJ process, a complex task. Productivity of a batch process may be increased by controlled feeding 
fi j of key nutrients to prolong the growth cycle. Such a fed-batch process is still a closed system 
%S because cells, products and waste products are not removed. 

s In what is still a closed system, perfusion of fresh medium through the culture can be 

^1 achieved by retaining the cells with a fine mesh spin filter and spinning to prevent clogging. 
Spin filter cultures can produce cell densities of approximately 5 x 10 7 cells/ml. A true open 

© system and the most basic perfusion process is the chemostat in which there is an inflow of 
medium and an outflow of cells and products. Culture medium is fed to the reactor at a 
predetermined and constant rate which maintains the dilution rate of the culture at a value less 
than the maximum specific growth rate of the cells (to prevent washout of the cell mass from the 
reactor). Culture fluid containing cells, cell products and byproducts is removed at the same rate. 

25 These perfused systems are not in commercial use for production from mammalian cell culture. 

3. Non-perfused attachment systems. 

Traditionally, anchorage-dependent cell cultures are propagated on the bottom of small 
glass or plastic vessels. The restricted surface-to-volume ratio offered by classical and traditional 
30 techniques, suitable for the laboratory scale, has created a bottleneck in the production of cells 
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and cell products on a large scale. To provide systems that offer large accessible surfaces for cell 
growth in small culture volume, a number of techniques have been proposed: the roller bottle 
system, the stack plates propagator, the spiral film bottles, the hollow fiber system, the packed 
bed, the plate exchanger system, and the membrane tubing reel. Since these systems are non- 
5 homogeneous in their nature, and are sometimes based on multiple processes, they can 
sometimes have limited potential for scale-up, difficulties in taking cell samples, limited 
potential for measuring and controlling the system and difficulty in maintaining homogeneous 
environmental conditions throughout the culture. 

10 A commonly used process of these systems is the roller bottle. Being little more than a 

large, differently shaped T-flask, simplicity of the system makes it very dependable and, hence, 
attractive. Fully automated robots are available that can handle thousands of roller bottles per 
yB day, thus eliminating the risk of contamination and inconsistency associated with the otherwise 
: fi required intense human handling. With frequent media changes, roller bottle cultures can 

If achieve cell densities of close to 0.5 x 10^ cells/cm 2 (corresponding to 10 9 cells/bottle or 10 7 
^1 cells/ml of culture media) . 

flj 4, Cultures on microcarriers 

Van Wezel (1967) developed the concept of the microcarrier culturing systems. In this 
|l system, cells are propagated on the surface of small solid particles suspended in the growth 
medium by slow agitation. Cells attach to the microcarriers and grow gradually to confluency of 
the microcarrier surface. In fact, this large scale culture system upgrades the attachment 
dependent culture from a single disc process to a unit process in which both monolayer and 
suspension culture have been brought together. Thus, combining the necessary surface for the 
25 cells to grow with the advantages of the homogeneous suspension culture increases production. 

The advantages of microcarrier cultures over most other anchorage-dependent, large-scale 
cultivation methods are several fold. First, microcarrier cultures offer a high surface-to-volume 
ratio (variable by changing the carrier concentration) which leads to high cell density yields and a 
30 potential for obtaining highly concentrated cell products. Cell yields are up to 1-2 x 10 7 cells/ml 
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when cultures are propagated in a perfused reactor mode. Second, cells can be propagated in one 
unit process vessels instead of using many small low-productivity vessels (i.e., flasks or dishes). 
This results in far better utilization and a considerable saving of culture medium. Moreover, 
propagation in a single reactor leads to reduction in need for facility space and in the number of 
5 handling steps required per cell, thus reducing labor cost and risk of contamination. 

Third, the well-mixed and homogeneous microcarrier suspension culture makes it 
possible to monitor and control environmental conditions (e.g., pH, p0 2 , and concentration of 
medium components), thus leading to more reproducible cell propagation and product recovery. 
10 Fourth, it is possible to take a representative sample for microscopic observation, chemical 
testing, or enumeration. Fifth, since microcarriers settle out of suspension easily, use of a fed- 
„ batch process or harvesting of cells can be done relatively easily. Sixth, the mode of the 
'42 anchorage-dependent culture propagation on the microcarriers makes it possible to use this 
flj system for other cellular manipulations, such as cell transfer without the use of proteolytic 
f J enzymes, cocultivation of cells, transplantation into animals, and perfusion of the culture using 
decanters, columns, fluidized beds, or hollow fibers for microcarrier retainment. Seventh, 
~ microcarrier cultures are relatively easily scaled up using conventional equipment used for 
l[l cultivation of microbial and animal cells in suspension. 

W 5. Microencapsulation of mammalian cells 

One method which has shown to be particularly useful for culturing mammalian cells is 
microencapsulation. The mammalian cells are retained inside a semipermeable hydrogel 
membrane. A porous membrane is formed around the cells permitting the exchange of nutrients, 
gases, and metabolic products with the bulk medium surrounding the capsule. Several methods 

25 have been developed that are gentle, rapid and non-toxic and where the resulting membrane is 
sufficiently porous and strong to sustain the growing cell mass throughout the term of the 
culture. These methods are all based on soluble alginate gelled by droplet contact with a 
calcium-containing solution. Lim (U.S. Patent 4,321,883) describes cells concentrated in an 
approximately 1% solution of sodium alginate which are forced through a small orifice, forming 

30 droplets, and breaking free into an approximately 1% calcium chloride solution. The droplets are 
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then cast in a layer of poly amino acid that ionically bonds to the surface alginate. Finally the 
alginate is reliquefied by treating the droplet in a chelating agent to remove the calcium ions. 
Other methods use cells in a calcium solution to be dropped into a alginate solution, thus creating 
a hollow alginate sphere. A similar approach involves cells in a chitosan solution dropped into 
5 alginate, also creating hollow spheres. 

Microencapsulated cells are easily propagated in stirred tank reactors and, with beads 
sizes in the range of 150-1500 mm in diameter, are easily retained in a perfused reactor using a 
fine-meshed screen. The ratio of capsule volume to total media volume can kept from as dense as 
10 1:2 to 1:10. With intracapsular cell densities of up to 10 , the effective cell density in the culture 
is 1-5 x 10 7 . 

The advantages of microencapsulation over other processes include the protection from 
jn j the deleterious effects of shear stresses which occur from sparging and agitation, the ability to 
IjS easily retain beads for the purpose of using perfused systems, scale up is relatively 
straightforward and the ability to use the beads for implantation. 

P 6. Perfused attachment systems 

^ Perfusion refers to continuous flow at a steady rate, through or over a population of cells 

10 (of a physiological nutrient solution). It implies the retention of the cells within the culture unit 
as opposed to continuous-flow culture which washes the cells out with the withdrawn media 
(e.g., chemostat). The idea of perfusion has been known since the beginning of the century, and 
has been applied to keep small pieces of tissue viable for extended microscopic observation. The 
technique was initiated to mimic the cells milieu in vivo where cells are continuously supplied 

25 with blood, lymph, or other body fluids. Without perfusion, cells in culture go through 
alternating phases of being fed and starved, thus limiting full expression of their growth and 
metabolic potential. The current use of perfused culture is to grow cells at high densities (i.e., 
0.1-5 x 10 cells/ml). In order to increase densities beyond 2-4 x 10 cells/ml (or 2 x 10 
cells/cm ), the medium has to be constantly replaced with a fresh supply in order to make up for 

30 nutritional deficiencies and to remove toxic products. Perfusion allows for a far better control of 
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the culture environment (pH, p0 2 , nutrient levels, etc.) and is a means of significantly increasing 
the utilization of the surface area within a culture for cell attachment. 

Microcarrier and microencapsulated cultures are readily adapted to perfused reactors but, 
5 as noted above, these culture methods lack the capacity to meet the demand for cell densities 
above 10 cells/ml Such densities will provide for the advantage of high product titer in the 
medium (facilitating downstream processing), a smaller culture system (lowering facility needs), 
and a better medium utilization (yielding savings in serum and other expensive additives). 
Supporting cells at high density requires efficient perfusion techniques to prevent the 
1 0 development of non-homogeneity. 

The cells of the present invention may, irrespective of the culture method chosen, be used 
m in protein production and as cells for in vitro cellular assays and screens as part of drug 
s ] development protocols. 

fl 

U! J. Kits 

s All the essential materials and reagents required for the various aspects of the present 

invention may be assembled together in a kit. When the components of the kit are provided in 
^ one or more liquid solutions, the liquid solution preferably is an aqueous solution, with a sterile 
3§ aqueous solution being particularly preferred. 

For in vivo use, the instant compositions may be formulated into a single or separate 
pharmaceutically acceptable syringeable composition. In this case, the container means may 
itself be an inhalant, syringe, pipette, eye dropper, or other such like apparatus, from which the 
25 formulation may be applied to an infected area of the body, such as the lungs, injected into an 
animal, or even applied to and mixed with the other components of the kit. 

The components of the kit may also be provided in dried or lyophilized forms. When 
reagents or components are provided as a dried form, reconstitution generally is by the addition 
30 of a suitable solvent. It is envisioned that the solvent also may be provided in another container 
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means. The kits of the invention may also include an instruction sheet defining administration of 
the gene therapy and/or the chemotherapeutic drag. 

The kits of the present invention also will typically include a means for containing the 
vials in close confinement for commercial sale such as, e.g., injection or blow-molded plastic 
containers into which the desired vials are retained. Irrespective of the number or type of 
containers, the kits of the invention also may comprise, or be packaged with, an instrument for 
assisting with the injection/administration or placement of the ultimate complex composition 
within the body of an animal. Such an instrument may be an inhalant, syringe, pipette, forceps, 
measured spoon, eye dropper or any such medically approved delivery vehicle. Additionally, 
instructions for use of the kit components is typically included. 

The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventor to function well in the 
practice of the invention, and thus can be considered to constitute preferred modes for its 
practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

EXAMPLE 1 
Modification of the RB Protein 
A, Construction of RB cDN As Expressing N-terminal Truncated pRB Proteins 

For construction of modified RB cDNAs with various N-terminal deletions, a series of 
PCR™ primers were designed and synthesized according to the sequences of RB cDNA. The 
sense primers were determined by the RB cDNA sequences downstream of the deleted 
N-terminal sequence. All primers contain a Hindlll restriction site (underlined) at the 5'-end and 
the consensus Kozak cassette (GCCGCC) followed by an ATG (italics). The complete 
nucleotide sequences of the sense primers are as follows: 
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5'-CCC AAGCTT GCCGCCv4rGGAGCAGGACAGCGGCCCGGAC-3' (OMRbSd2-34; 
SEQ IDN0:14); 

5'-CCC AAGCTT GCCGCC4 rGGATTTT ACTGC ATTATGTC AG-3 ' (OMRbSd2-55 ; 
SEQ ID NO: 15); 

5 5'-CCC AAGCTT GCCGCC^rGGAGAAAGTTTCATCTTGTGAT-3' (OMRbSd2-78; 

SEQ ID NO: 16); 

5'-CCC AAGCTT GCCGCC^rGCTGTGGGGAATCTGTATCTTT-3' (OMRbSd2-97; 
SEQ ID NO: 17); 

5'-CCC AAGCTT GCCGCC4rGTCAAGACTGTTGAAGAAG-3' (OMRbSdl-147, SEQ 
10 ID NO: 18). 

The anti-sense primer 5'-GTCCAAG AGAATTC ATAAAAGG-3 ' (OMRbAS300; SEQ 
ID NO: 13) overlaps with the EcoW site (underlined) at the nucleotide +900 of the RB cDNA 
yl (the A of the first in-frame ATG is designated as position +1). The anti-sense primer was paired 
SI with each sense primer described above to amplify various modified 5'-RB cDNA fragments 
1=1 using plasmid F7 as template (which contains the full-length RB cDNA). 

s After amplification by PCR™ with each pair of primers, the DNA fragments were 

si digested with Hindlll and EcoRL and subcloned into plasmid pCMVRB 110 which had been cut 
'r° with the same enzymes. The resultant expression plasmids carrying the modified RB cDNAs 
2j| with N-terminal deletions corresponding to amino acids 2-34 (SEQ ID NO:28 (nucleic acid 
fe sequence) and SEQ ID NO:29 (amino acid sequence)), 2-55 (SEQ ID NO:30 (nucleic acid 
sequence) and SEQ ID NO:31 (amino acid sequence)), 2-78 (SEQ ID NO:32 (nucleic acid 
sequence) and SEQ ID NO:33 (amino acid sequence)), 2-97 (SEQ ID NO:34 (nucleic acid 
sequence) and SEQ ID NO:35 (amino acid sequence)) and 1-147 (SEQ ID NO:36 (nucleic acid 
25 sequence) and SEQ ID NO:37 (amino acid sequence)) were named as pCMVRBd 2 .3 4 (a deletion 
of amino acids 2 to 34 of the wild type RB protein), pCMVRBd2_55 (a deletion of amino acids 2 
to 55 of the wild type RB protein), pCMVRBd 2 _ 78 (a deletion of amino acids 2 to 78 of the wild 
type RB protein), pCMVRBd 2 . 97 (a deletion of amino acids 2 to 97 of the wild type RB protein) 
and pCMVRBd M47 (a deletion of amino acids 1 to 147 of the wild type RB protein; amino acid 
30 148 is a methionine) respectively. 
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B. Construction of RB cDNAs with Internal Deletions or Mutations 

A total of seven pRB expression plasmids carrying RB cDNAs with varying internal 
deletions or mutations have been constructed, namely pCMVRBd 31 . 107 (a deletion of amino acids 
5 31 to 107 of the wild type RB protein), pCMVRBd 77 . 107 (a deletion of amino acids 77 to 107 of 
the wild type RB protein), pCMVRBm ill/112 (a mutation of amino acid 1 1 1 of the wild type RB 
protein from aspartic acid to glycine and a mutation of amino acid 112 from glutamic acid to 
aspartic acid), pCMVRBd 11M81 (a deletion of amino acids 111 to 181 of the wild type RB 
protein), pCMVRBd m _ 2 4i (a deletion of amino acids 111 to 241 of the wild type RB protein), 
10 pCMVRBdi 

81-241 ( a deletion of amino acids 181 to 241 of the wild type RB protein) and 
pCMVRBd 24 2-3oo ( a deletion of amino acids 242 to 300 of the wild type RB protein), 

? B For the construction of pCMVRBd 3M07? an RB cDNA fragment from nucleotide position 

ry +325 to +910 was amplified from the plasmid F7 by PCR™ using the primers 
j| 5'-GCG CCTGAGG ACCTAGATGAGATGTCGTTC-3' (SEQ ID NO: 19) and OMRbAS300 
U] (SEQ ID NO: 13). This RB cDNA fragment was digested with Bsu36l (underlined) and £coRI 
s (from OMRbAS300), and inserted into plasmid pCMVRB 110 digested with the same enzymes, to 
^ replace the original RB cDNA fragment from nucleotides +91 to +900. The nucleic acid 
^ sequence of pRBA31-107 is SEQ ID NO:38, and the corresponding amino acid sequence is SEQ 
W ID NO:39. 

For the construction of pCMVRBd 77 . 107 , an RB cDNA fragment (nucleotides +328 to 
+910) was amplified from the plasmid F7 by PCR™ using the oligonucleotides 
5'-GCGGTTAACCCTAGATGAGATGTCGTTCACT-3' (SEQ ID NO:20) and OMRbAS300 
25 (SEQ ID NO: 13), followed by digestion with Hpal (underlined) and EcoN. The amplified, 
digested fragment was inserted into plasmid pCMVRB 110 digested with the same enzymes, to 
replace the RB cDNA fragment from nucleotides +230 to +900. The nucleic acid sequence of 
pRBA77-107 is SEQ ID NO:40, and the corresponding amino acid sequence is SEQ ID NO:41. 

-79- 



For the construction of pCMVRBm 111/n 2, two pairs of primers were used to change 
nucleotide A (position +332 of the wild-type RB cDNA) to G, in order to change the codon for 
aspartic acid (GAT) to glycine (GGT), thus creating a new restriction enzyme site, Avrll, and 
nucleotide G (position +336 of the wild-type RB cDNA) to T ? in order to change the codon for 
5 glutamic acid (GAG) to aspartic acid (GAT). The first pair of primers are 
S'-CCCMGCTTGCCGTC^trGCCGCCCAAAACCCCCCGA-S' (OMRBS1; SEQ ID NO:21) 
and 5 '-CTC A C CTAGG TC A ACTGCTGC AAT-3 ' (OMRbAS332; SEQ ID NO:22; the mutated 
base is in bold). The second pair of primers are 5'-GTTGA CCTAGG TGATATGTCGTTC-3 ; 
(OMRbS332; SEQ ID NO:23; the mutated bases are in bold) and OMRbAS300 (SEQ ID 

10 NO: 13). The PCR™ products amplified with OMRBS1 and OMRbAS332 were digested with 
Hind III and Avrll (underlined), and those amplified with OMRbS332 and OMRbAS300 were 
digested with ^4vrII and EcdRI. These fragments were ligated together into plasmid pCMVRB 1 10 

y;l digested with Hindlll and EcoRl to replace the corresponding wild-type RB cDNA sequences. 

ifjj The nucleic acid sequence of pRBmlll/112 is SEQ ID NO:50> and the corresponding amino 

Q acid sequence is SEQ ID NO:5 1 . 



s For the construction of pCMVRBd m _ 181 the RB cDNA fragment (nucleotides +543 to 

SI +910) was amplified from plasmid F7 by PCR™ using the oligonucleotides 
^ 5 f -GCG CCTAGGA TCTACTGAAATAAATTCTGCA-3' (SEQ ID NO:24) and OMRbAS300 
2© (SEQ ID NO: 13), followed by digestion with ^4vrII (underlined) and EcoKL. This fragment was 
then ligated into pCMVRBm 111/112 (above) digested with the same enzymes to replace the RB 
cDNA fragment from nucleotides +331 to +900. The nucleic acid sequence of pRBAlll-181 is 
SEQ ID NO:42, and the corresponding amino acid sequence is SEQ ID NO:43. 

25 For the construction of pCMVRBd m _ 241? a 5' RB cDNA fragment containing nucleotides 

+1 to +331 was obtained by digestion of pCMVRBm m with Hindlll and Avrll. The 3' RB 
cDNA fragment beginning from nucleotide +722 was isolated from the same plasmid digested 
with Pvull and BamHl. Then the two DNA fragments (in-frame) were ligated into pCMV-G 
digested with Hindlll and BamHl. The nucleic acid sequence of pRBAl 11-241 is SEQ ID 

30 NO:44, and the corresponding amino acid sequence is SEQ ID NO:45. 
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For the construction of pCMVRBd 181 . 241? a 5'-RB cDNA fragment containing nucleotides 
+1 to +538 was amplified from plasmid F7 by PCR™ with primers OMRBS1 (SEQ ID NO:21) 
and 5 r -CCC GATATC AACTGCTGGGTTGTGTCAAATA-3' (SEQ ID NO:25) using plasmid F7 
5 as a template. The obtained RB cDNA fragment was cut with Hindlll and EcoRY (underlined), 
and inserted into pCMVRB 110 to replace the original 5' RB cDNA fragment between the Hindlll 
and Pvull sites. The nucleic acid sequence of pRBA181-241 is SEQ ID NO:46, and the 
corresponding amino acid sequence is SEQ ID NO:47. 

10 For the construction of pCMVRBd 242 _ 300 , primers OMRBS1 (SEQ ID NO:21) and 5'- 

CCCGAATTCGTTTTATATGGTTCTTTGAGCAA-3 ' (SEQ ID NO:26) were used to amplify 

,^ the 5' RB cDNA fragment containing nucleotides +1 to +722 using plasmid F7 as a template. 

yj The amplified product was digested with #mdIII and EcoRl (underlined), and inserted into 

fl pCMVRB 110 digested with the same enzymes to replace the original 5'RB cDNA sequences 
from nucleotides +1 to +900. The nucleic acid sequence of pRBA242-300 is SEQ ID NO:48, 

y J and the corresponding amino acid sequence is SEQ ID NO:49. 

.SSES. 

SI C. Characterization of N-terminal Modified RB Proteins 

^ An RB-defective bladder carcinoma cell line, 5637 was transfected with the expression 

W plasmids carrying the modified RB cDNAs driven by a CMV promoter. The biological function 
of the mutant pRBs was evaluated by a combined technique involving immunocytochemical 
staining and [ H]-thymidine in situ labeling of the tumor cells after transfection (Xu et ah, 1994a; 
1994b). 

25 Tumor cells were seeded onto coverslips in medium containing tetracycline and 

transfected with plasmids expressing pRB 94 , pRB 110 or other mutant RB proteins. At specified 
time point after removal of tetracycline from the culture medium, the cells were incubated with 1 
ml of fresh medium containing 10 jaCi [ 3 H]-methyl thymidine (Amersham, Arlington Heights, 
IL) for 2 hours at 37°C, then fixed and immunochemical^ stained for expression of RB protein 

30 as described previously (Xu et ah, 1991a; 1991b). Stained slides were subsequently coated with 
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a thin layer of gelatin and dried at 37°C overnight. The slides were then overlaid with 
autoradiographic emulsion (Type NTB2, Eastman Kodak, Rochester, NY) and exposed for 2 
days. After development, slides were examined under a light microscope. Twenty-four hours 
after transfection, cells were processed for immunocytochemical staining of RB protein and [ H]- 
5 thymidine incorporation assay as described above. 

The results are illustrated in Table 5. When up to 55 amino acid residues were deleted 
from the N-terminal of pRB, the DNA synthesis was not significantly reduced in the cells 
transfected with the mutant pRB expression plasmids compared to cells expressing the full- 
10 length RB protein. However, when another 23 amino acids were removed from the N-terminal, 
the cellular DNA synthesis was dramatically suppressed by expression of the truncated pRB. 

i Table 5 

ry % Cells Incorporating [ 3 H]-Thymidine 



RB Construct 


RB 1 


RIY 


Wild-Type 


14 


41 


d2-34 


12 


42 


d2-55 


11 


43 


d2-78 


3 


41 


d2-97 


3 


42 


dl-112(RB 94 ) 


2 


42 


dl-147 


4 


42 


d31-107 


3 


41 


d77-107 


2 


40 


dl 11-1 12 


6 


40 


dlll-181 


3 


38 


dl 11-241 


2 


40 


dlll-414 


24 


42 


dl81-241 


8 


43 


d242-300 


17 


43 
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As demonstrated in Table 5, the pRB mutants with any deletions between amino acid 55 
and 181 significantly inhibit DNA synthesis after being introduced into the tumor cells. Of note, 
cells transfected with pRBs containing deletions only between amino acid 181 and 241 showed 
weaker inhibition of DNA synthesis than those transfected with plasmids expressing pRBs 
carrying deletions between amino acid 55 and 181, although these were still more effective than 
cells transfected with the full-length pRB expression plasmid. Thus, in view of this data, 
modifications that combine certain of the above deletions, for example a deletion between amino 
acid 1 and amino acid 241, would be expected to have similar significant DNA synthesis 
inhibitory activity. 

Additionally, two pRB mutants with two deletions each, either between amino acid 2 and 
34 and between amino acids 76 and 112, or between amino acids 2 and 55 and between amino 
acids 76 and 112 significantly inhibited DNA synthesis as compared to the wild-type RB. The 
results indicated the boundary of the putative N-terminal domain probably located between 
amino acid 182 and 300, most probably between amino acid 182 and 241. In addition, a pRB 
carrying a point mutation at amino acid position 111 converting aspartic acid to glycine 
significantly suppressed DNA synthesis, further suggesting that this region is vital for regulating 
pRB function. 

EXAMPLE 2 

Modification of the CMV Promoter/Enhancer Controlling Expression of the VP16 
Transactivating Domain in the Tetracycline-Responsive Gene Expression System 

The modified retinoblastoma genes and proteins described above have a number of 
practical utilities, including, but not limited to, gene therapy. For these aspects, expression 
systems are needed. While systems such as those described above are appropriate for certain 
embodiments, they have certain shortcomings in relation to gene therapy using cytotoxic 
constructs. The original tetracycline-responsive gene expression system of Gossen and Bujard 
(1992) is an attractive system, but has certain drawbacks, such as squelching effects on cell 
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growth (Gill and Ptashne, 1988). To overcome these and other drawbacks, the inventors have 
improved the tetracycline-responsive gene expression system. 

The original tetracycline repressor/operator-based regulatory system consists of two 
5 plasmids, pUHD15-l and pUHC13-3 (U. S. Patent 5,464,758, incorporated in its entirety herein 
by reference; Gossen and Bujard 1992). pUHC13-3 is a tetracycline (Tc; tet) sensitive 
expression vector containing a hybrid minimal human CMV promoter, in which tet operator 
sequences had been inserted upstream of the TATA box. pUHD15-l contains sequences 
encoding a tetracycline responsive transactivator (tTA), with expression driven by a wild-type 
10 CMV promoter. In transient experiments using this system, the inventors found that efficiently 
reversible transgene expression was observed in many tumor cell lines studied. However, 
attempts to isolate long-term clones expressing the reporter gene in a tetracycline-responsive 
ji manner were unsuccessful This was most likely caused by the high intracellular levels of the 
:T tTA transactivator, whose expression was driven by the strong CMV promoter/enhancer 
W sequence in the plasmid pUHD15-l. The tTA transactivator contains the VP- 16 activating 
yi domain, which is known to have squelching effects on cell growth (Gill and Ptashne, 1 988). 

Therefore, to resolve this problem and to further improve the system, the tTA expression 
U cassette was first modified by replacing the strong CMVp enhancer (Boshart et al. 9 1985) in the 
2| original pUHD15-l plasmid with a pair of 19 bp imperfect direct repeat sequence (a portion of 
^ the CMVp enhancer; SEQ ID NO:5). The modification of the hCMV promoter/enhancer was 
done by removal of a portion of the 5 ? enhancer sequences from the hCMV promoter. 

Three pairs of oligonucleotide primers were designed based on the published sequence of 
25 the hCMV promoter (Boshart et al, 1985). A Xhol and an EcoBl restriction enzyme site 
(underlined) was added to the 5' end of each sense and the anti-sense oligo, respectively. The 
sense oligos are: 5 ? -CCGCTCGAGCAATGGGCGTGATAGCGG-3' (OMCMVsl; SEQ ID 
NO:6); 5 '-CCGCTCGAGCACC AAAATCAACGGGA-3 ' (OMCMVs2; SEQ ID NO:7) and 5'- 
CCGC1CGAGCAACTCCGCCCCATTGAC-3' (OMCMVs3; SEQ ID NO:8), respectively, and 
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they shared the same anti-sense primer, 5 ' -TAGAC ATAT G AATTC GCGGCC-3 ' (OMCMVas; 
SEQ IDNO:9). 

The template used in PCR™ amplification was plasmid pUHD15-L PCR™ amplification 
5 with primer pairs of OMCMVsl + OMCMVas; OMCMVs2 + OMCMVas and OMCMVs3 + 
OMCMVas, generated three shorter versions of CMV promoter with lengths of 282 bp (namely 
mhCMVpl), 203 bp (mhCMVp2) and 168 bp (mhCMVp3) respectively. The purified shortened 
CMV promoter/enhancer fragments were double digested with Xhol and EcoRl, and inserted into 
pUHD15-l to replace the original hCMV promoter. This produced three new tTA expressing 
1 0 plasmids, namely pmCMV 1 -tTA, pmCMV2-tT A and pmCMV3-tTA. 

To determine the relative strength of these promoters, the tTA in these newly constructed 
y;! plasmids, as well as plasmid pUHD15-l, was replaced by a chloramphenicol acetyltransferase 
fj] (CAT) gene from plasmid pRc/CMV-CAT (Invitrogen, San Diego, CA), thus generating four 
l| CAT expression plasmids, pmCMVl-CAT, pmCMV2-CAT, pmCMV3-CAT and pCMV-CAT. 
Ifl In these plasmids, CAT expression is driven by mhCMVpl, mhCMVp2, mhCMVp3 and the full- 
s length hCMVp, respectively. To evaluate the relative activity of the modified CMV promoters, 

the CAT expression plasmids were introduced into three cell lines, the tumor cell lines 5637 and 
^ Saos2, and the embryonal kidney cell line 293, via the Lipofectin method (Life Technologies, 
W. Gaithersburg, MD). Forty-eight hours after transfection, cell lysates were prepared and CAT 

activity was measured by a CAT FLASH assay kit from Stratagene (Stratagene, La Jolla, CA). 

As shown in FIG. 1, after enhancer sequences were partially removed, the activity of the 
promoter was dramatically reduced in all three transfected cell lines. FIG. 1 is a graphical 

25 representation of the CAT activity in the 5637 and Saos-2 cell lines. The more enhancer 
sequences that were deleted, the weaker was the promoter that remained. The order of promoter 
activity from strongest to weakest is hCMV, mhCMVpl, mhCMVp2 and mhCMVp3. The 
activity of mhCMVpl is 17.7% of the full-length hCMV promoter, while the mhCMVp3 activity 
is only 3.3% of the hCMV promoter in 5637 cells (FIG. 1). After comparing the relative 

30 promoter activity of the modified promoters, mhCMVpl (SEQ ID NO: 5) was chosen for the 
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modified tetracycline regulatable gene expression system. mhCMVpl showed optimal 
tetracycline-controlled transactivator (tTA) expression with no squelching effects on host cell 
growth (FIG. 2), an important characteristic for potential use in human gene therapy. 

EXAMPLE 3 

Construction of Single Plasmid,, Tetracvcline-Regulated Vector 

A single plasmid vector named EC1214A was constructed. This plasmid contains: 1) the 
modified tetracycline-responsive transactivator (tTA) expression cassette to eliminate the 
squelching effects of tTA on host cell growth; 2) the tTA-dependent promoter from plasmid 
pUHC13-3; 3) a generic intron sequence; 4) a multiple cloning site downstream of the promoter 

and intron; and 5) a neo^ expression cassette to allow G418 selection. Expression in this system 
is regulated by tetracycline, or a tetracycline analog. A "tetracycline analog" will be understood 
to be any one of a number of compounds that are closely related to tetracycline, and which bind 
to the tet repressor with at least an affinity (KJ of at least 10 6 /M, preferably with a K a of 10 9 /M, 
and more preferably with a K a of 10 n /M. Exemplary, but in no way limiting, of such 
tetracycline analogs are those disclosed by Hlavka and Boothe (1985), Mitschef (1978), the 
Noyee Development Corporation (1969), Evans (1968) and Dowling (1955), each of which is 
incorporated herein in its entirety. 

Plasmid pMLSIS.CAT (Choi etah, 1991) contains an generic intron sequence which 
consists of a portion of the 5 '-untranslated leader from the adeno virus-major-late region, which 
contains part of the first exon of the tripartite and the first intervening sequence, as well as a 
synthetic splice donor/acceptor sequence derived from an IgG variable region. A pair of 
oligonucleotides, 5 ' -CTA GAATTC GCTGTCTGCG-3 ? (SEQ ID NO:10) and 5'- 
GCTCTAGATGCAGTTGGACCTGGGAG-3 9 (SEQ ID NO:l 1), flanking the intron sequence in 
plasmid pMLSIS.CAT and containing an EcoRI and Xbal site, respectively (underlined), were 
synthesized. After amplification by PCR™, the intron fragment was digested with EcoRI and 
Xbal, and inserted into the corresponding enzyme sites in plasmid pUHD15-l. 
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Subsequently, a small DNA fragment containing CM, HindUI, EcoKV, EcoRI, Pstl, 
Smal and BamUl cloning sites (obtained from plasmid pBluescriptSK) was inserted into the new 
plasmid downstream of the intron to produce an expression vector containing the hCMV 
promoter, a generic intron, multiple cloning sites and a polyadenylation signal from the SV40 
5 virus. This intermediate vector was given the name of pCMV-G. The SV40 polyadenylation 
signal of pCMV-G was then replaced by a HSV thymidine kinase (TK) gene polyadenylation 
signal sequence to generate a plasmid, named pCMV*-G-TKpA. 

Plasmid pRc/CMV (Invitrogen, San Diego, CA) was double digested with restriction 
1 0 enzymes Nrul and Xbal. The 5' overhang from the Xbal digest was filled in by Klenow fragment 
of DNA polymerase (Life Technologies, Gaithersburg, MD), and the blunt-ended insert was 
ligated to a DNA fragment containing mhCMVl-tTA obtained from plasmid pmCMVl-tTA 
y j (Example 2). The new plasmid was named pmCMV 1 -tTA.neo. 

$3 Finally, a DNA fragment containing the tTA-dependent promoter, the generic intron and 

Ul the TK polyadenylation signal was isolated from plasmid pCMV*-G-TKpA, and inserted into the 
3 Bglll site of plasmid pmCMVl -tTA.neo to produce a vector named EC1214A, which carries 
^ both the tTA expression cassette and the tTA-dependent promoter as well as a selection marker, 
H the neomycin resistance gene. 

ii 

" ' EXAMPLE 4 

Construction of a Single Plasmid Tetracycline Positively-Induced (Tet-on) Vector 

The original tetracycline repressor/operator-based tet-on system also consists of two 
plasmids, pUHD17-lneo (or pUHD172-lneo) and pUHC13-3 (Gossen etal, 1995). pUHC13-3 

25 is a tetracycline sensitive expression vector containing a hybrid minimal human CMV promoter, 
in which tet operator sequences had been inserted upstream of the TATA box. pUHD17-lneo or 
pUHD172-lneo contains sequences encoding a reverse tetracycline responsive transactivator 
(rtTA), with expression driven by a wild-type CMV promoter. In transient experiments using 
this system, it was found that efficiently reversible transgene expression was observed in many 

30 tumor cell lines studied. As opposed to the original tetracycline system, expression is turned on 
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in the presence of tetracycline or a tetracycline analog, such as doxycycline, while expression is 
turned off in the absence of tetracycline. However, the rtTA transactivator contains the VP- 16 
activating domain, which is known to have squelching effects on cell growth (Gill and Ptashne, 
1988). 

Therefore, to resolve this problem and to further improve the system, the rtTA expression 
cassette was first modified by replacing the strong CMVp enhancer (Boshart et al. 9 1985) in the 
pUHD17-lneo or pUHD172-lneo plasmid with a pair of 19 bp imperfect direct repeat sequence 
(SEQ ID NO:5). The modification of the hCMV promoter/enhancer was done by removal of a 
portion of the 5' enhancer sequences from the hCMV promoter (Example 2). The new rtTA 
expressing plasmid was named pmCMVl-rtTA. 

A single plasmid vector named EC1214B was constructed using pmCMVl-rtTA. This 
plasmid contains: 1) the modified reverse tetracycline-responsive transactivator (rtTA) 
expression cassette to eliminate the squelching effects of rtTA on host cell growth; 2) the 
rtTA-dependent promoter from plasmid pUHC13-3; 3) a generic intron sequence; 4) a multiple 

cloning site downstream of the promoter and intron; and 5) a neo^ expression cassette to allow 
G41 8 selection. The construction was performed as outlined in Example 3. 

EXAMPLE 5 

Construction of Retinoblastoma (RB) and p53 Tetracvcline-Controlled Vectors 
A. Construction of Inducible pRB 110 Expression Vector 

To construct an inducible pRB 110 expression plasmid, plasmid F7 (Takahashi et al 3 1991) 
or p4.95BT (Friend et a/., 1987), containing the full-length RB 110 gene cDNA, was digested with 
the restriction enzymes Acyl at nucleotide -322 and Seal at +3230 (the A of the second in-frame 
ATG start codon was designated nucleotide +19). The 5' overhangs generated by the Acyl digest 
were treated with E. coli DNA polymerase I in the presence of all four dNTPs to generate blunt 
ends. BamUl linkers were ligated onto the fragment, and the fragment was then digested with 
BamBI to remove excess linkers and generate BamRl ends (Maniatis etal 9 1989; Ausubel etaL, 
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1992). The resultant RB cDNA fragment of 3552 bp was inserted into the unique BamHl site of 
EC1214A to generate pCMV*-tTA-RB 110 . 

B. Construction of Inducible pRB 94 Expression Vector 

It is known that the primary sequence surrounding the AUG codon GCC( A G )CCAUGG 
(SEQ ID NO:27) is the optimal context for initiation of translation in higher eukaryotes (Kozak 
1991). A surprising realization is that, although nearly all vertebrate mRNAs have features that 
ensure the fidelity of initiation, many mRNAs that encode critical regulatory proteins do not 
appear to be designed for efficient translation (Kozak 1991). In reviewing the RB cDNA 
sequence, it was found that the AUG start codon for both the full length pRB 110 and the N- 

94 

terminal truncated pRB are in a suboptimal context for initiation of translation in higher 
eukaryotes. For example, there is an out-of-frame AUG codon at the nucleotide -5 position (the 

94 

A of the ATG start codon for the pRB cDNA is designated nucleotide +1), and the leading 

94 

sequence of the ATG codon for pRB is suboptimal as compared to the consensus initiator 

94 

context shown above. To improve the translation efficiency of the pRB cDNA, site-directed 
mutagenesis was used to optimize the DNA sequence upstream of the second internal in-frame 
ATG codon of RB^ 4 for optimal translational initiation. 

The modified 5'-RB 94 cDNA fragment was obtained by PCR™ using plasmid F7 carrying 
the full-length RB 110 cDNA as the template. The sense primer used for the PCR™ reaction (5'- 
CCCMGCTTGCCGCCATGTCGTTCACTTTTAC-3'; SEQ ID NO:12) contained a HindUI 
restriction site (underlined) and a Kozak cassette (italics; Kozak, 1987). The antisense primer 5'- 
GTCC AAG AGAATTC ATAAAAGG-3 ' (OMRbAS300; SEQ ID NO: 13) overlapped with the 
£coRI site (underlined) at nucleotide +900 of the RB cDNA (the A of the first in-frame ATG is 
designated as position +1). The PCR™ product was digested with HindXR and EcoRL, then 
ligated with a DNA fragment containing the 3'-RB cDNA fragment between £coRI (position 
+900) and BamRl (+3548) isolated from plasmid F7. The entire RB 94 cDNA fragment was 
inserted into the flwdlll and BamRl sites of EC1214A to produce the inducible pRB 94 
expression plasmid, pCMV*-tTA-RB 94 . 
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C. Construction of Inducible p53 Expression Vector 

A plasmid, pC53-SN3 (Baker etal, 1990), containing the full length p53 gene cDNA 
was digested with BamHl, and the fragment containing the full length p53 gene was inserted into 
5 the unique Bamlil site of EC1214A to generate pCMV*-tTA-p53. 

EXAMPLE 6 
Preparation of Long-Term Tumor Cell Clones with 
Tetracvcline-Regulated pRBHO, pRB94 or p53 Expression 

10 The modified, single-plasmid tetracycline-responsive mammalian gene expression system 

has been used to obtain various stable tumor cell lines in which expression of the wild-type or the 
N-terminal truncated retinoblastoma (RB) tumor suppressor gene, or the p53 tumor suppressor 

yi gene can be reversibly turned on and off without detectable leakage. 

: hs sir 

W A. Cell Culture 

10 A breast carcinoma cell line, MDA-468 (HTB132) was obtained from ATCC and 

; w cultured in Leibovitz's L-15 (Life Technologies, Gaithersburg, MD) with 10% FBS (Life 
Technologies, Gaithersburg, MD). An osteosarcoma cell line, Saos2 was cultured in medium 
M McCoy's 5 A (Life Technologies, Gaithersburg, MD) with 15% FBS (Zhou et al 9 1994b). A 
H bladder carcinoma cell line, 5637 (HTB9) obtained from ATCC was cultured with RPMI 1640 
w " medium (Life Technologies, Gaithersburg, MD) containing 10% FBS. All cell culture media 
were supplemented with 0.5% penicillin/streptomycin. Saos2 and 5367 cells were incubated at 
37°C in a 5% C0 2 incubator, while MDA-468 cells were cultured at 37°C without C0 2 . 

25 B. Stable Transfection 

Tumor cells were transfected with the pRB 110 and pRB 94 expression plasmids, pCMV*- 
tTA-RB 110 and pCMV*-tTA-RB 94 via the Lipofectin method according to the manufacturer's 
instruction manual (Life Technologies, Gaithersburg, MD). During transfection and the 
subsequent procedures except where specified, 0.5 |ag/ml of tetracycline (Sigma, St. Louis, MO) 
30 was added to the transfection and culture media. Forty-eight hours after transfection, G418 (Life 
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Technologies, Gaithersburg, MD.) was added to the culture media at a concentration of 300 \ig/l. 
Two to three weeks later, single colonies were isolated by cloning rings. A duplicate culture was 
made for each isolated colony. While the original clone was kept in media containing 0.5 (ig/ml 
tetracycline, the duplicate clone was cultured in the absence of tetracycline. The latter was 
immunochemical^ stained with a specific anti-RB antibody, RB-WL-1 (Xu etal., 1989a). The 
matched RB-positive clones were subsequently maintained in medium containing tetracycline 
and G418 and expended for further analyses. 

C. Transient Transfection 

Tumor cells were seeded into 60-mm culture dishes or onto sterile coverslips at 
concentrations that would reach about 40% confluent next day. Twenty hours later, proper 
amount of plasmid DNA was mixed with Lipofectin reagent in Opti-MEM medium according to 
the manufacture's instruction manual (Life Technologies, Gaithersburg, MD). Cells were 
overlaid with the DNA-Lipofectin complex and incubated in a C0 2 incubator at 37°C overnight. 
Next day, fresh medium was added to replace the DNA-Lipofectin. Twenty-four or forty-eight 
hours later, cells were fixed for immunochemical staining or lysed for preparation of cell lysates. 

D. Immunocytochemical Staining of RB Protein 

Immunocytochemical staining was performed as described previously (Xu et ai, 1989a). 
For detection of RB expression, cells grown on coverslips were fixed in 45% (vol/vol) 
acetone/10% (wt / vol) formaldehyde/0.1 M phosphate buffer for 5 min. After being washed six 
times with phosphate-buffered saline, cells were blocked with 1% non-fat milk/1. 5% goat serum 
or horse serum in phosphate buffer for 4 hours at room temperature. The RB-WL-1 anti-RB 
antibody or Canji's monoclonal anti-RB antibody (QED, San Diego, CA) was diluted to 2 jag/ml 
or 0.5 (ag/ml respectively in the same solution plus 0.02% Triton X-100, and was incubated with 
the cell overnight. After being washed, the coverslips were processed for immunostaining with 
the avidin biotinylated peroxidase complex (ABC) method according to the technical manual 
(Vector Laboratories, Burlingame, CA). 
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E. Immunoblotting for pRB 

Cell lysate was prepared as previously described (Xu etaL, 1991a; 1991b). Briefly, 
cultured cells in 60 mm dishes were lysed with 0.6 ml of ice-cold lysis buffer containing 100 
mM NaCl, 0.2% NP-40, 0.2% sodium deoxycholate, 0.1% SDS and 50 mM Tris-HCl (pH8.0) 
with 50 |ig/ml aprotinin and 1 mM PMSF. The cell lysate was passed through 21 gauge needle 
several times and clarified by centrifugation. 

Direct Western immunoblotting was done as described previously (Xu etaL, 1991a; 
1991b). Sixty micrograms of total cellular protein as determined by the Bradford protein assay 
(BioRad, Richmond, CA) was electrophoresed in an 8% SDS/polyacrylamide gel and 
electroblotted to Immobilon polyvinylidene difluoride membranes (PVDF) (Millipore, Bedford, 
MA). After being blocked with 4% bovine serum albumin/1% normal goat serum in Tris- 
buffered saline, membranes were incubated overnight with RB-WL-1 antibody at a final 
concentration of 0.4 fag/ml for RB detection. The blots were then probed by the ProtoBlot 
Western blot alkaline phosphatase system (Promega, Madison, WI). 

F. Growth Curve Measurement 

A crystal violet staining method was used to measure the cell growth changes in the 
presence or absence of tetracycline (Gillies et ah, 1986). Briefly, cells were seeded into 24-well 
plates in duplicate. In one set of the plates, cells were grown in medium containing 0.5 p,g/ml 
tetracycline, while in duplicate plates, the same cells were cultured in non-tetracycline media. At 
each time point, cells were fixed with 1% glutaraldehyde in PBS and stained using 0.5% of 
crystal violet. After cells at all desired time points were collected, the crystal violet dye was 
extracted from the stained cells by incubating cells with Sorenson's solution containing 0.9% 
trisodium citrate, 0.02 N chloric acid and 45% ethanol (vol/vol). The extracted dyes were diluted 
properly with the Sorenson's solution and optical absorbencies at X 550 were measured. Growth 
curves were obtained by plotting the OD 550 against the time. 
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G. Soft Agar Assay 

For soft agar assay, appropriate number of cells were mixed with 0.3% of agarose in 
complete medium containing 15% FBS and overlaid onto 0.7% base agar in a 35 mm tissue 
culture dish. Duplicate dishes were prepared for each individual cell clones. Cells in one dish 
were cultured in the medium containing 0.5 |ng/ml of tetracycline and the other cultured in non- 
tetracycline medium. The medium was replenished every 3 days, and colonies (>50 cells) were 
counted after 3 weeks. Results were calculated as the average of three dishes per cell clone. 

H. Tumorigenicity Test in Nude Mice 

The tumorigenicity test has been described previously (Takahashi et al 9 1991). Two 
groups of athymus nude mice were set up for each cell clone to be tested. One group of mice 
were given regular water, while the other group was given water containing 5 mg/ml of 
tetracycline. A total of 5 x 10 6 cells from each RB 110 - or RB 94 -reconstituted clone were injected 
subcutaneously in 0.2 ml of phosphate buffered saline into the right flank of nude mice. RB- 
negative parental controls including Saos2, 5637 and MDA-468 cells were injected at the 
identical concentration into the left flank of the same mice. Tumors were scored 4 weeks after 
injection. 

I. Time Course Study of [ H] -Thymidine Incorporation 

Cells from inducible RB -reconstituted clones were grown on sterile coverslips in medium 
containing tetracycline. At specified time point after removal of tetracycline from the culture 
medium, the cells were incubated with 1 ml of fresh medium containing 10 |LiCi [ 3 H]-methyl 
thymidine (Amersham, Arlington Heights, IL) for 2 hours at 37°C ? then fixed and 
immunochemical^ stained for expression of RB protein as described previously (Xu et ah, 
1991a; 1991b). Stained slides were subsequently coated with a thin layer of gelatin and dried at 
37°C overnight. The slides were then overlaid with autoradiographic emulsion (Type NTB2, 
Eastman Kodak, Rochester, NY) and exposed for 2 days. After development, slides were 
examined under a light microscope. 
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J. [ 3 H] -Thymidine Incorporation of Transiently Transfected Cell Cultures 

Tumor cells were seeded onto coverslips and transfected with plasmids expressing pRB 94 , 
pRB 110 or other mutant RB proteins. Twenty-four hours after transfection, cells were processed 
for immunocytochemical staining of RB protein and [ H]-thymidine incorporation assay as 
5 described in Xu et al (1991b; 1991c). 

K. Characterization of Long-Term Inducible RB Expression Clones 

The cell growth suppression and morphological changes after RB replacement that have 

been reported in the literature are inconsistent. Studies done by the inventors and others 
10 indicated that replacement of the normal RB gene into i?5-defective tumor cells could suppress 

their tumorigenic activity in nude mice (Goodrich and Lee 1993, Bookstein et al, 1990a; 1990b; 

Chen et al, 1992; Goodrich et al, 1992b; Huang et al, 1988 ; Kratzke et al, 1993; Madreperla 
J et al, 1991; Muncaster et al, 1992; Ookawa et al, 1993; Sumegi et al, 1990; Takahashi et al, 
t] 1991; Wang et al, 1993; Xu et al, 1996; Xu et al, 1991c; Zhou et al, 1994b; Xu, 1996; Xu, 
IS 1995; Li et al, 1996; Xu et al, 1994b). The tumor cell lines studied were derived from widely 
Iff disparate types of human cancers such as the retinoblastoma, osteosarcoma, carcinomas of the 

bladder, prostate, breast and lung (Goodrich and Lee, 1993; Xu, 1996; Xu, 1995 for review). 
^ Although it has been well documented that correction of the RB gene defect alone in tumor cells 
Isfe carrying multiple genetic alterations was sufficient to revert their malignant phenotype, it was 
|| more puzzling than it appeared at first sight (Klein, 1990). 

As was shown in several early studies, after transient transfection with pRB-expressing 
plasmids, some types of the i?5-defective tumor cells in culture displayed striking changes, 
including cell enlargement, senescent-like phenotype and growth cessation (Templeton et al, 

25 1991; Qin et al, 1992). Subsequently, it was found that, however, long-term stable clones of the 
^-reconstituted tumor cells can be isolated that grew just as rapidly as the parental lines. 
Therefore, there has been a tendency in the literature to separate the inhibition of cell growth by 
RB replacement in 7?i?-defective tumor cells from its tumor suppression function (Chen et al, 
1992; Goodrich etaL, 1992b; Takahashi etal, 1991; Xu et al, 1991b; Zhou et al, 1994b; Li et 

30 al, 1996). 
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Three RB-defective tumor cell lines were used to establish long-term inducible RB 
expression clones. They were the osteosarcoma cell line, Saos2, the bladder cancer cell line, 
5637 and the breast cancer cell line, MDA-468. The rationale for choosing Saos2, 5637 and 
MDA-468 as recipient cells was that they are the RB-defective tumor cells most in use for RB- 
replacement studies. The tumor cells were transfected with the inducible RB 110 expression 
plasmid, pCMV*-tTA-RB 110 and the pRB 94 expression plasmid, pCMV*-tTA-RB 94 in the 
presence of tetracycline. After selection in 400 ^ig/ml of G418 for approximately 2 to 4 weeks, 
well separated single colonies were isolated and maintained in tetracycline containing media. A 
small portion of the isolated clones were cultured separately in the absence of tetracycline (Tc) 
for 24 to 48 hours and stained with an anti-RB antibody, RB-WL-1. Tight control of pRB 
protein expression in the stable clones of Tc-responsive ^-reconstituted 5637 bladder 
carcinoma and MDA-MB-468 breast carcinoma cells is seen. 

The RB-reconstituted 5637 cells grown in the presence of 0.5 jag/ml of Tc in the culture 

medium are RB" by immunocytochemical staining, while after removal of Tc, the pRB 

expression was turned on in the RB-reconstituted 5637 cells as shown by RB + 

immunocytochemical staining. The MDA-MB-468 breast carcinoma tumor cells were also RB" 
by immunocytochemical staining in the presence of 0.5 |ug/ml of Tc in culture medium, whereas 
after removal of Tc, the pRB expression was turned on in the RB-reconstituted MDA-MB-468 

breast carcinoma cells as shown by RB + immunocytochemical staining. Note that tetracycline is 
an inhibitor, rather than an inducer, in this tetracycline-responsive expression system. 

The minimal concentration of tetracycline required to shut off RB expression was also 
tested. It was found that as little as 0.1 |iig/ml of tetracycline can inhibit RB expression to non- 
detectable level by immunostaining, indicating that the tetracycline-regulated expression system 
is very sensitive to tetracycline. 
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Additionally, it was surprisingly found that, unlike the non-regulatable, long-term RB- 
reconstituted tumor cell lines previously reported, all the long-term tumor cell clones examined 
irreversibly ceased growing after pRB expression was turned on in Tc-free medium (FIG. 3A, 
FIG. 3B and FIG. 3C). It is known in the literature that the half-life of pRB in normal and tumor 
5 cells is only 4 to 6 hours (Mihara et al, 1989; Xu et al, 1994b; Xu et al 9 1989a), and as was 
illustrated in FIG. 2, using the modified tetracycline-regulatable system, expression of tTA 
transactivator per se in the presence or absence of low concentration of Tc had no effect on cell 
growth. 

10 The Saos2 and 5637 clones also failed to synthesize DNA, which were followed by 
noticeable morphological changes and finally, by cell death. The cellular morphology was 
markedly altered after pRB expression was induced in Tc-free medium, including cell 

: '3 enlargement, flattening, and lower nucleocytoplasmic ratio than cycling Gl/S cells. In the case 
zr\ of the bladder carcinoma cell line, 5637, changes in morphology and growth rate after either 
W transient or stable RB-replacement with a non-regulatable system have not been well 

11 documented in the literature (Goodrich et al, 1992b; Takahashi et al, 1991; Zhou et al, 1994b). 

In general, the phenotypes of the established Tc-regulatable RB + tumor lines in Tc-free 
J medium were quite similar to those documented previously for RB plasmid-transfected (or RB 
28 retrovirus vector-infected) tumor cell mass cultures (Huang et ai 9 1988; Templeton et al, 1991; 
Qin et al, 1992). All tumor cell clones under permissive condition for pRB expression were 
unable to form colonies in soft agar (FIG. 4A, FIG. 4B and FIG. 4C), and were non-tumorigenic 
in nude mice. 

25 To compare RB with another common tumor suppressor gene, p53, several long-term 

stable tumor cell clones with Tc-regulatable wild-type p53 expression have been established 
from the osteosarcoma cell line, Saos-2. A similar approach as described above was used to 
establish the p53 -reconstituted Saos-2 tumor cell clones. In brief, the parental Saos-2 tumor cells 
were transfected with the wild-type p53-expressing plasmid, pCMV*-tTA-p53 (Example 5) and 

30 selected in geneticin-containing media. The initial G418-resistant mass cultures were subjected 
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to at least two rounds of subcloning in order to obtain stable wild-type /?53-reconstituted clones. 
Because of complete deletion of the p53 gene, the parental Saos-2 cells have no endogenous p53. 

With this model system, it was found that induction of wild-type p53 expression in p53- 
5 reconstituted Saos-2 clones did result in growth arrest of the RB~/p53 nuU tumor cells. When the 
Tc-regulated /755-reconstituted Saos-2 clones were grown in the absence of Tc, many tumor cells 
shrank and detached. Furthermore, as measured by DNA fragmentation assay, abundant low 
molecular weight DNAs were detected only in samples extracted from ^55-reconstituted Saos-2 
tumor cells under permissive condition for p53 expression. These observations indicate that the 
10 wild-type p53-induced growth arrest of the RB~/p53 nu11 Saos-2 tumor cells was the result of 
apoptotic cell death rather than replicative senescence. 

Dimri et al recently reported a biomarker that identifies senescent human cells in culture 

|1i and in aging skin in vivo. It was show that several human senescent cells expressed a p- 

IjS galactosidase, histochemically detectable at pH 6 (Dimri et al, 1995). This marker, termed 

% senescence-associated p-galactosidase (SA-p-gal), is expressed by senescent, but not pre- 

^ senescent fibroblasts. SA-p-gal was also absent from immortal cells, but was induced by genetic 

ry manipulations that reversed immortality (Dimri et al, 1995). Of note, some cells, such as adult 

[ n melanocytes, expressed the SA-P-gal (pH 6 activity) independent of senescence or age. Thus, 

2§ SA-p-gal is not a universal marker of replicative senescence, which is not surprising. 

Nevertheless, by utilizing the instant long-term tumor cell clones with tetracycline- 
regulatable pRB or p53 expression, the SA-p-gal (pH 6 activity) provides a simple assay 
allowing the further characterization the RB-mediated tumor cell growth cessation. The majority 
25 (>99.9%) of young (early passage) human WI-38 fibroblasts are SA-p-gal negative. In contrast, 
the senescent (at population doubling level greater than 52) WI-38 cells were strongly SA-p-gal 
positive. All tetracycline-responsive tumor cell clones examined so far were SA-p-gal negative 
in the presence of tetracycline (RB~), and were SA-P-gal positive in tetracycline-free medium 
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(RET). The intensity of SA-p-gal staining of tumor cells in RET status, however, was variable 
depending on tumor cell types. 

Of note, although p53 reconstitution in Saos-2 (RB~, p53 nu11 ) tumor cells with either non- 
5 inducible (Chen et al, 1990; Li et aL, 1996) or inducible system did suppress their neoplastic 
phenotype, the p53 reconstituted Saos-2 clones with the tetracycline-regulatable promoter were 
SA-(3-gal negative in either presence or absence of tetracycline. Of great interest, when the p53- 
reconstituted Saos-2 cells were infected with recombinant adenovirus vectors expressing the 
wild-type pRB 110 in Tc-free medium, the tumor cells with both wild-type p53 and pRB 110 
10 expression displayed more intense SA-p-gal positive staining as compared to tumor cells only 
expressing pRB 110 . The results imply that the mechanisms for tumor suppression by pRB and 

% p53 were different from each other, but expression of pRB and p53 together had synergistic 
y effects on RB-mediated tumor cell senescence. 

IS In consideration of its potential therapeutic use, another important finding was the fact 

&4 * that the pRB-mediated replicative senescence (irreversible growth cessation) was tumor-specific. 
C3 The young WI-38 fibroblasts at early passage infected with recombinant adenovirus vector, 
rU; AdCMVpRBl 10 at multiplicity of infection (MOI) of 100 remained SA-p-gal negative, and they 

resumed a normal growth pattern about one week post-infection. Therefore pRB is a relatively 
JO safe reagents for anticancer gene therapy. In addition to therapy of advanced malignancies, the 

emerging RB gene therapy also may be beneficial in treating post-surgery residue tumors, 

superficial cancers, or premalignancies, as well as non-malignant, hyperproliferative disorders in 

certain circumstances (Chang et aL 9 1995; Xu et al, 1996). 

25 L. The broad biological basis of the TIB-mediated tumor suppression. 

In addition to tumor cell-specific senescence and the well-known antiproliferative effects, 
pRB may also play a role in inhibition of angiogenesis and in elicitation of immunogenicity of 
tumor cells. The inventors have shown that serum-free conditioned media (CM) collected from 
the tetracycline-responsive, ^-reconstituted osteosarcoma and non-small cell lung carcinoma 
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cell lines switched from angiogenic to anti-angiogenic after removal of Tc from the cell cultures. 
This switch corresponded with the onset of pRB expression as determined by Western blotting 
and immunohistochemistry (Dawson et aL 9 1996). The inventors have also reported that HLA 
class II induction by IFN-y in the RB-defective non-small cell lung carcinoma cell line, H2009, 
requires reconstitution of the wild-type RB gene expression (Lu et al y 1996). The class II 

proteins present peptides derived from proteolytically processed antigens to CD^ T 
lymphocytes as part of the immune response. Therefore, pRB likely has a role in mediating 
tumor immunogenicity as well. 

To determine if replacement of the retinoblastoma (RB) tumor suppressor gene could 
inhibit invasion of i?£-defective tumor cells, studies were conducted using the Boyden chamber 
assay (Li et al 9 1996). The studies were done in a diverse group of stable ^-reconstituted 
human tumor cell lines, including those derived from the osteosarcoma and carcinomas of the 
bladder, breast and lung. The expression of the exogenous wild-type RB protein in these tumor 
cell lines was driven by either a constitutively active promoter or an inducible promoter. It was 

found that significantly more tumor cells from the parental 7?5-defective cell lines and the RB" 
revertants than from the ^-reconstituted RB + cell lines penetrated through the Matrigel in the 
Boyden chamber assay (p < 0.001, two-tailed t-test). Of note, the inhibition of invasiveness of 
various 7?5-defective tumor cells by RB replacement was apparently well correlated with 
suppression of their tumorigenicity in vivo. In contrast, although either functional RB or p53 re- 
expression effectively suppressed tumor formation in nude mice of the RB7p53 nu11 osteosarcoma 
cell line, Saos-2, replacement of the wild-type p53 gene had much less impact on their 
invasiveness as compared to the RB gene. 

Normal human diploid cells senesce in vitro and in vivo after a limited number of cell 
divisions. This process known as cellular senescence is an underlying cause of aging and a 
critical barrier for development of human cancers. It has also been demonstrated that RB/p53- 
defective tumor cells reexpressing functional pRB alone via a modified tetracycline-regulated 
gene expression system were irreversibly growth-arrested at G0/G1 phase of the cell cycle. 
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These cells displayed multiple morphological changes consistent with cellular senescence and 
also expressed a senescence-associated (3-galactosidase biomarker. 



Further studies indicated that telomerase activity, which was presumably essential for an 
5 extended proliferative life-span of neoplastic cells, was repressed in the tumor cell lines after 
induction of pRB (but not p53) expression. These observations suggest that pRB plays a critical 
role in the intrinsic cellular senescence program. From a practical standpoint, findings imply that 
cytostatic gene therapy using RB (or RB and p53 together) may result in differential elimination 
of tumor cells through cellular senescence and crisis. At the same time the replicative lifespan of 
10 normal cells in vivo may not be affected. This could provide a potential basis for designing 
tumor-specific tumor suppressor gene therapy and anti-telomerase gene therapy. 

yi These findings, taken together, may intimate that the i?£-mediated tumor suppression has 

IT- a broad biological basis, which certainly makes the emerging RB tumor suppressor gene therapy 
W for human cancer even more attractive. 

J* M. Enhanced Tumor Suppression by an N-terminal Truncated pRB. 

JrJ Long-term stable clones of the ^-reconstituted tumor cells can be isolated with non- 

inducible gene expression systems, and most of these clones grow just as rapidly as the parental 

2j| lines. The inventors have also found that, although the £8-mediated tumor suppression was 
substantial and had a broad biological basis, it was often incomplete and a portion of the 

^-reconstituted tumor cells were able to survive and form RB + xenograft tumors in nude mice 
after a prolonged latency period (Takahashi et ah, 1991; Xu et ah, 1991b; Zhou et ah, 1994b; Li 
et ah, 1996). Similar observations have been reported by other investigators (Bookstein et al, 
25 1990b; Goodrich etah, 1992b; Kratzke etah, 1993; Ookawa et al, 1993; Wang et ah, 1993). 
This phenomenon is referred to by the inventors as tumor suppressor resistance (TSR; Zhou et 
ah, 1994b), which is an equivalent of multiple drug resistance (MDR) in chemotherapeutics. In 
the latter scenario, low-dose chemotherapy may risk the selection of metastatic tumor cells due to 
their often inherently higher resistance to cytotoxic agents. 
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The inventors subsequently reported that an N-terminal truncated RB protein of -94 kDa 
(pRB 94 ) exerted surprisingly more potent cell growth suppression as compared to the full-length 
pRB protein in a diversity of tumor cell lines examined, including those having a normal 
5 endogenous RB gene. Tumor cells transfected with the pRB 94 -expressing plasmids displayed 
multiple morphological changes frequently associated with cellular senescence. They failed to 
enter S phase and rapidly died (Xu et al 9 1994b; Resnitzky and Reed, 1995). 

The inventors recent studies in ectopic animal models demonstrated that treatment of 

10 established human RB" and RB + bladder xenograft cancers in nude mice by AdCMVpRB94, a 
replication-deficient adenovirus vector expressing the N-terminal truncated RB protein, resulted 
in regression of the treated tumors (Xu et aL, 1996). Of note, although both the full-length and 

€l the truncated forms of the RB protein, when over-expressed in tumor cells via adenovirus 

pj vectors, were capable of suppression of tumor growth, the pRB 94 was much more potent than the 

K full-length RB protein. The mechanism for the enhanced tumor suppression by the N-terminal 

y j truncated RB protein is not clear yet. 

O 94 

11 l To better understand the functional difference between the N-terminal truncated pRB 

jl and the full-length pRB 110 , the inventors have also established stable tumor cell lines with Tc- 

yj 94 

2fi responsive pRB expression. By time course analysis, it was found that as early as 6 hours after 

94 

removal of tetracycline from the cell culture medium, the pRB -reconstituted tumor cells 

94 

accumulated the maximum of both underphosphorylated and phosphorylated pRB , followed by 

failure of the vast majority of the tumor cells to incorporate 3 H-thymidine, an indicator of growth 
94 

cessation. The pRB protein was completely dephosphorylated within -18 to 24 hours. Most 
25 of the pRB 1 ^-reconstituted tumor cells, however, remained immuno-histochemically RB" at the 
6 or 8 hr-time points and had normal DNA synthesis (FIG. 5). The pRB 110 reached the highest 
level at the 24 hr-time point as determined by western blotting, and became mostly 
unphosphorylated from 24 to 48 hours after removal of tetracycline, in which period the pRB 1 10 - 
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reconstituted tumor cells finally ceased DNA synthesis (FIG. 5). Using the SA-p-gal biomarker 

94 

assay for human senescent cells, it was shown that the Saos-2 cells with pRB expression 

showed more intense SA-p-gal positive staining as compared to the pRJB 110 -expressing cells at 

94 110 
48 hr after removal of Tc. Since pRB has a longer half-life than pRB and tends to remain in 

5 an active, underphosphorylated form (U. S. Patent 5,496,731; Xu etal, 1994b), rapid 
accumulation of mostly the active forms (underphosphorylated form) of RB protein in the tumor 

94 

cells may account for the enhanced tumor cell growth suppression by pRB . In this regard, 
another truncated version of pRB, named pRB 56 , beginning at amino acid 379, has also been 
reported as a more potent inhibitor of cell cycle progression compared to the full-length pRB 
10 (Wills etal, 1995). 

Hi The advantages of the modified system are threefold: 1) it is suitable for establishing 

J* long-term stable cell lines with inducible gene expression because of lower constitutive 
*P expression of the tTA peptide; 2) the system is now contained within a single plasmid so that 
i]k only one round of transfection and selection is required; and 3) of importance, the single-plasmid 
tetracycline-responsive mammalian gene expression system is readily convertible to 
£3 tetracycline-controlled viral vectors (Examples 7-12 below). 

% EXAMPLE 7 

W Construction of Tetracvcline-Controlled Adenoviral Vectors 

The desired cDNA fragment of a gene of interest is first inserted into the single-plasmid 
tetracycline-regulatable plasmid vector, EC1214A (Example 3) or EC1214B (Example 4). The 
tetracycline-responsive foreign gene expression cassette and the modified tTA (or rtTA) 
expression cassette from the corresponding EC1214A or EC1214B plasmid vectors are then 
25 recovered using standard methods in the art for DNA manipulation (Maniatis et al, 1989; 
Ausubel et al 9 1992), and inserted into the shuttle plasmid, pAElsplA (Microbix Bio systems, 
Inc.). The resultant recombinant shuttle plasmids are then co-transfected with the master 
adenovirus type 5 (Ad5) plasmid, pBHGll, which contains the backbone of the adenovirus 
Ad5dl309 genome and E1/E3 deletion mutation (Microbix Biosy stems, Inc.) into 293 cells using 
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the LIPOFECTIN reagent (GIBCO/BRL Life Technologies). The co-transfection of 293 cells is 
performed in the presence (for tet-off system) or absence (for tet-on system) of 0.5 |ig/ml of 
tetracycline. 

5 Alternatively, a fragment containing a gene of interest is first inserted into the single- 

plasmid tetracycline-regulatable plasmid vector, EC1214A or EC1214B. The tetracycline- 
responsive foreign gene expression cassette and the modified tTA (or rtTA) expression cassette 
from the corresponding EC1214A or EC1214B plasmid vectors are then recovered and inserted, 
respectively, into the shuttle plasmid, pAElsplA and the master adenovirus plasmid, pBHGll. 

10 The resultant recombinant shuttle plasmids and the recombinant master adenovirus plasmid are 
co-transfected into 293 cells. 

yl Co-transfection of 293 cells with the recombinant shuttle plasmid and the recombinant 

ST master adenovirus plasmid produce infectious virions by in vivo recombination, in which the 
iS minigene cassette expressing the gene of interest and the modified tTA (or rtTA) expression 
111 cassette are replaced the AE1 region or AE1 and AE3 regions of the Ad5dl309 genome, 
s " respectively. Presence of recombinant adenoviruses in the transfected 293 cells is initially 
it identified by cytopathic effect (CPE). Cell culture supernatants are collected from the 
I s * transfected 293 cells in which CPE has occurred. Recombinant viruses are then isolated by 
ifj screening adenovirus plaques from 293 cell monolayers after infection with the virus 
supernatants, and further characterized by restriction enzyme digestion mapping, PCR™, or by 
expression of the gene of interest in virus-infected host cells in a tetracycline-regulatable manner. 
The recombinant adenoviruses containing the desired foreign gene as well as the modified tTA 
(or rtTA) expression cassettes are subjected to at least three rounds of plaque purification. 

25 

High-titer stocks of the tetracycline-controlled recombinant adenoviruses are prepared by 
methods modified from Graham and Prevec, (1991). The CsCl ultracentrifugation-purified 

13 

adenoviruses contain -10 viral particles per ml as measured by OD at 260 nm (1 OD 260 = 1 x 
10 12 viral particles per ml). The concentrated viral suspension is desalted by gel filtration 
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through Sephadex G50 to generate a final purified virus stock about 10 11 plaque-forming units 
(pfii) per ml in PBS. 

EXAMPLE 8 

5 Preparation of Tetracvcline-Responsive RB Adenovirus Vector 

A replication-deficient adenovirus vectors expressing N-terminal truncated pRB 94 
protein (U. S. Patent No. 5,496,731) has been used in in vivo animal studies of human cancer 
gene therapy (Xu et al 9 1996). Unfortunately, the ratio of viral particles to plaque-forming units 
of the AdCMVpRB94 virus supernatants increased dramatically with passage, making it difficult 

10 for large-scale preparation of high-titer stocks of the AdCMVpRB94 virus for human cancer 
gene therapy clinical trials. This was probably caused by the super cell growth suppression 

^ effects of pRB94 protein on the 293 virus-producing cell line. 

Ill The modified tetracycline-responsive mammalian gene expression system has been used 

lp in a similar manner as described above to generate a tetracycline-controlled pRB 94 -containing 
]4 adenovirus vector, AdVtTA.RB94, which is designed for delivery of high-dose pRB gene 
L therapy. The entire tetracycline regulation cassette can be inserted into the El region of the 
W adenovirus genome, or the RB 94 expression cassette can be inserted into the El region of the 
yi adenovirus genome, while the transcriptional transactivation fusion protein expression cassette is 
M inserted into the E3 region of the adenovirus genome. Over-expression of pRB 94 in tumor cells 
will cause tumor cell-specific senescence and cell death. The pRB 94 cDNA has a modified 
optimal initiator context sequence. Expression of the pRB94 protein in transduced human tumor 
cells by AdVtTA.RB94 can be reversibly turned off and on. The novel AdVtTA.RB94 
recombinant adenovirus vector can be propagated efficiently in 293 cells with increased yield 
25 and quality. 
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EXAMPLE 9 

Preparation of Tetracvcline-Responsive RB/p53 Coexpression Vector 

As described in Example 6 above, although p53 reconstitution in Saos-2 (RB", p53 nuR ) 
tumor cells with either non-inducible (Chen et al, 1990; Li et aL, 1996) or inducible system did 
suppress their neoplastic phenotype, the p53 reconstituted Saos-2 clones with the tetracycline- 
regulatable promoter were SA-p-gal negative in either presence or absence of tetracycline. 
However, when the p53-reconstituted Saos-2 cells were infected with recombinant adenovirus 
vectors expressing the wild-type pRB 110 in Tc-free medium, the tumor cells with both wild-type 
p53 and pRB 110 expression displayed more intense SA-p-gal positive staining as compared to 
tumor cells only expressing pRB 110 . The results imply that the mechanisms for tumor 
suppression by pRB and p53 were different from each other, but expression of pRB and p53 
together had synergistic effects on RB-mediated tumor cell senescence. 

Since co-expression of pRB and p53 has synergistic effects on pRB-mediated, tumor- 
specific senescence (Example 6), and it has been suggested that altered RB and p53 protein status 
could be a synergistic prognostic factor in non-small cell lung carcinomas, as well as a subset of 
other human malignancies, including transitional cell carcinomas of the bladder (Xu, 1995; Xu et 
aL, 1994a; Xu et aL, 1996), combination pRB and p53 gene therapy is also contemplated as an 
alternative strategy to surmount possible tumor suppressor resistance. 

Insertion of both the modified tetracycline-responsive transactivator (tTA) expression 
cassette and the tTA-dependent pRB 110 expression cassette into the El region of the Ad5 
genome facilitates construction of an adenovirus vector simultaneously expressing two tumor 
suppressor genes, named AdVtTA.RB110/p53. In this vector, the smaller p53 expression 
cassette is inserted into the E3 region of the 34 kb master plasmid, pBHGll, through ligation 
reaction. Since attempts to replace both RB and p53 genes in the same cell have never been 
successful (Wang et aL, 1993), the inventors reasoned that adenovirus vectors simultaneously 
expressing the two tumor suppressor genes should be built in the regulatable gene expression 
system. 

-105- 



EXAMPLE 10 

Construction of Tetracycline-Controlled Retroviral Vectors 

The kat retrovirus production system produces high titer retrovirus supernatant capable of 
5 transducing efficiently hematopoietic cell types refractory to conventional retrovirus transduction 
(Finer et al 9 1994). The kat retrovirus plasmid vector with a hybrid LTR with will be combined 
with EC1214A (Example 3) to generate a retrovirus with Tc-regulatable expression. Since some 
success using standard retroviral vectors have been reported in the literature, the Tc-controlled 
retroviral vector may work better than the Tc-controlled adenoviral vector for transduction of 
10 certain cell types, such as hematopoietic stem cells. 

EXAMPLE 11 

yi Therapeutic Administration of Modified RB Constructs 

1\ A. Treatment of Human Bladder Cancers in vivo. 

i§ The human bladder cancer represents an ideal model for practicing tumor suppressor gene 

U1 therapy of solid tumors by infusing the instant modified RB protein expression retroviral vectors 
.7 into the bladder. The original experimental model of human bladder cancer was established by 
T\ Jones and colleagues (Ahlering et al, 1987). It has been shown that human bladder tumor cells 
"? k of RT4 cell line established from a superficial papillary tumor, which usually does not 
2j| metastasize, produced tumors only locally when injected by a 22-gauge catheter into the bladder 
of female nude mice. In contrast, the EJ bladder carcinoma cells which were originally isolated 
from a more aggressive human bladder cancer produced invasive tumors in the nude mouse 
bladders which metastasized to the lung spontaneously. Therefore, this model can be used for 
treatment of experimental bladder cancer by in vivo gene transfer with retroviral vectors. 

25 

Tumor cells from RB minus human bladder carcinoma cell line, 5637 (ATCC HTB9) and 
RB + human bladder carcinoma cell line, SCaBER (ATCC HTB3) will be injected directly into 
the bladders of female athymic (nu/nu) nude mice (6 to 8 weeks of age) by a catheter as initially 
reported by Jones and colleagues (Ahlering et ah, 1987). Development and progression of the 
30 nude mouse bladder tumors will be monitored using a fiber-optical system to which a TV 
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monitor is attached. The experimental tumors will subsequently be treated with retrovirus 
vectors expressing the modified RB proteins of the present invention. 

Supernatants with high virus titers will be obtained from tissue culture media of selected 
clones expressing high level of human modified RB protein and confirmed as free of replication- 
competent virus prior to use. The retroviral vector suspension at high titers ranging from 4 x 10 4 
to greater than 1 x 10 7 colony-forming unit (cfu)/ml, and more preferably at a titer greater than 1 
x 10 6 cfu/ml will then be infused directly into the mouse bladders via a catheter to treat the 
tumors. The skilled artisan will understand that such treatments may be repeated as many times 
as necessary via a catheter inserted into the bladder. The tumor regression following transferring 
the modified RB gene will be monitored frequently via the fiber-optic system mentioned above. 

The same procedure as described above may be used for treating the human bladder 
cancer except that the retroviral vector suspension is infused into a human bladder bearing 
cancer. 

B. in vivo Studies Using an Orthotopic Lung Cancer Model 

Human large cell lung carcinoma, NCI-H460 (ATCC HTB177) cells which have normal 
pRB 110 expression will be injected into the right mainstream bronchus of athymic (nu/nu) nude 
mice (10 5 cells per mouse). Three days later the mice will be inoculated endobronchically with 
supernatant from the modified RB, or wild-type RB retrovirus producer cells daily for three 
consecutive days. Tumor formation suppression in the group of mice treated with the modified 
RB retrovirus supernatant, in contrast, to the group which is treated with wild-type RB retrovirus 
supernatant, will indicate that the modified RB-expressing retrovirus inhibits growth of RB + 
non-small cell lung carcinoma (NSCLC) cells, whereas the wild-type RB-expressing retrovirus 
does not. 

C. Treatment of Human Non-Small Cell Lung Cancers in vivo. 

Non-small cell lung cancer patients having an endobronchial tumor accessible to a 
bronchoscope, and also having a bronchial obstruction, will be initially selected for modified RB 
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gene therapy. Treatment will be administered by bronchoscopy under topical or general 
anesthesia. To begin the procedure, as much gross tumor as possible will be resected 
endoscopically. A transbronchial aspiration needle (21G) will be passed through the biopsy 
channel of the bronchoscope. The residual tumor site will then be injected with the appropriate 

5 modified RB retroviral vector supernatant, modified RB adenovirus suspension or modified RB- 
expressing plasmid vector-liposome complexes at a volume of 5 ml to 10 ml. Protamine may be 
added to a concentration of 5 ]ag/ml. The injections of therapeutic viral or plasmid supernatant 
comprising one or more of the vectors will be administered around and within the tumor or 
tumors and into the submucosa adjacent to the tumor. The injections will be repeated daily for 

10 five consecutive days and monthly thereafter. The treatment may be continued as long as there is 
no tumor progression. After one year the patients will be evaluated to determine whether it is 
appropriate to continue therapy. 

f{\ In addition, as a precaution, the patients will wear a surgical mask for 24 hours following 

l|: injection of the viral supernatant. All medical personnel will wear masks routinely during 
If! bronchoscopy and injection of the viral supernatant. Anti-tussive will be prescribed as 
J necessary. 

D. Treatment or Prevention of Human Lung Carcinomas With Liposome- 
a| Encapsulated Purified Modified RB Protein 

~ v * In yet another alternative, target tumor or cancer cells will be treated by introducing the 

instant modified RB proteins into cells in need of such treatment by any known method. For 
example, liposomes are artificial membrane vesicles that have been extensively studied for their 
usefulness as delivery vehicles of drugs, proteins and plasmid vectors both in vitro or in vivo 

25 (Mannino et ah, 1988). Proteins such as erythrocyte anion transporter (Newton etaL, 1988), 
superoxide dismutase and catalase (Tanswell et ah, 1990), and UV-DNA repair enzyme (Ceccoli 
et aL, 1989) have been encapsulated at high efficiency with liposome vesicles and delivered into 
mammalian cells in vitro or in vivo. Further, small-particle aerosols provide a method for the 
delivery of drugs for treatment of respiratory diseases. For example, it has been reported that 

30 drugs can be administered in small-particle aerosols by using liposomes as a vehicle. 
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Administered via aerosols, the drugs are deposited rather uniformly on the surface of the 
nasopharynx, the tracheobronchial tree and in the pulmonary area (Knight et ah, 1988). 

To treat or prevent lung cancers, the therapeutic modified RB proteins will be purified, 
5 for example, from recombinant baculoviras AcMNPV-modified RB infected insect cells by 
immunoaffmity chromatography or any other convenient source. The modified RB protein will 
then be mixed with liposomes and incorporated into the liposome vesicles at high efficiency. 
The encapsulated modified RB will still be active. Since the aerosol delivery method is mild and 
well-tolerated by normal volunteers and patients, the modified RB-containing liposomes can be 
10 administered to treat patients suffering from lung cancers of any stage and/or to prevent lung 
cancers in high-risk population. The modified RB protein-containing liposomes may 
administered by nasal inhalation or by a endotracheal tube via small-particle aerosols at a dose 
^ sufficient to suppress abnormal cell proliferation. Aerosolization treatments will be administered 
hi to a patient for 30 minutes, three times daily for two weeks, with repetition as needed. The 
lSj modified RB protein will thereby be delivered throughout the respiratory tract and the pulmonary 
y l area. The treatment may be continued as long as necessary. After one year, the overall condition 
7 of the patient will be evaluated to determine if continued therapy is appropriate. 

EXAMPLE 12 

2j|| Induction of Senescence and Telomerase Inhibition by Reexpression of RB 

^ Normal human diploid cells senesce in vitro and in vivo after a limited number of cell 

divisions. This process, known as cellular senescence, is an underlying cause of aging and a 
critical barrier for development of human cancers. This Example presents studies that 
demonstrate that reexpression of functional pRB alone in RB/p5 3 -defective tumor cells via a 
25 modified tetracycline-regulated gene expression system resulted in a stable growth arrest at the 
G0/G1 phase of the cell cycle, preventing tumor cells from entering S phase in response to a 
variety of mitogenic stimuli. These cells displayed multiple morphological changes consistent 
with cellular senescence and expressed a senescence-associated (3-galactosidase biomarker. 
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Additionally, telomerase activity, which is believed to be essential for an extended 
proliferative life-span of neoplastic cells, was abrogated or repressed in the tumor cell lines after 
induction of pRB (but not p53) expression. Strikingly, when returned to an non-permissive 
medium for pRB expression, the pRB-induced senescent tumor cells resumed DNA synthesis 
5 and attempted to divide. However, most cells died in the process, a phenomenon similar to 
postsenescent crisis of SV40 T-antigen-transformed human diploid fibroblasts in late passage. 
These observations provide direct evidence that overexpression of pRB alone in RB/p53- 
defective tumor cells is sufficient to reverse their immortality and cause a phenotype that is, by 
all generally accepted criteria, indistinguishable from replicative senescence. The results indicate 
1 0 that pRB may play a causal role in the intrinsic cellular senescence program. 

= ~* A. Materials and Methods 

O Establishing tumor cell lines with Tc-regulatable pRB expression 

£1 The original multiple-plasmid tetracycline repressor/operator-based regulatory system 

,^15 was improved as described in detail above. All /^-reconstituted tumor cell lines used in this 
A- Example were subjected to at least two rounds of subcloning following the initial plasmid 
£ ;) transfection and are considered pure clones. The homogeneity of these clones was verified by 
l': pRB nuclear staining. In addition, a panel assay (Zhou et aL, 1994) was used to ensure stable 
^ expression of the functional pRB under permissive conditions. The ^-reconstituted tumor cells 
oi20 were all RET in the presence of 0.5 |ig/ml of Tc in culture medium; while the great majority 
(>99%) of the cells became RB + at 24 hours after removal of Tc as shown by 
immunocytochemical staining. 

Flow cytometric analysis 

25 Single cell suspensions collected at each time point were fixed with paraformaldehyde 

and ethanol before propidium iodide (PI) (Sigma) staining. All profiles were generated using a 
FACScan flow cytometer (Becton-Dickinson). The first peak (Ml) contains cells with diploid 
DNA in G0/G1, the second peak (M3) with twice the PI -fluorescence intensity contains 
tetraploid G2/M cells, and the area between the two peaks (M2) represents the total number of 

30 cells in S phase (Nicoletti et al 9 1991). 
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SA-$-gal assay 

The assay was performed essentially as previously described (Dimri et al, 1995). 
Briefly, the cells were fixed in 2% formaldehyde/0.2% glutaraldehyde for 5 min and stained with 
5 5-bromo-4-chloro-3-indolyl p-D-galactoside (X-Gal) at pH 6.0 for 6 hours. The staining solution 
contained 1 mg/ml X-Gal, 40 mM citric acid/sodium phosphate, pH 6.0, 5 mM potassium 
ferrocyanide, 5 mM potassium ferricyanide, 150 mM NaCl and 2 mM MgCl 2 . 

Telomeric repeat amplification protocol (TRAP) assay 
10 The methodology, according to the technical manual, was modified from the original 

TRAP assay as described by Kim et al (Kim et al, 1994). In short, ~10 6 cells grown in a 
r ~ 100-mm Petri dish were harvested and resuspended in 200 |ul of ice-cold lysis buffer for 30 min 
;^ on ice, followed by centrifugation at 100,000 x g for 30 min at 4°C. The supernatant was diluted 
HI to 0.5 \ig protein/ jil, of which 2\xl was used for each TRAP assay. The telomerase reaction was 
1JC carried out at 30°C for 30 min, which was followed by a 2-step PCR™ amplification with 
J [y- 32 P]-labeled TS primer (94°C, 30 s and 60°C, 30 s for 33 cycles). The PCR™-amplified 
* telomerase extension products were subjected to electrophoresis on a 12.5% polyacrylamide gel. 

B. Results 

2|f pRB-mediated irreversible growth cessation of tumor cells 

Using the modified tetracycline (Tc)-regulatable gene expression system as described in 
detail above, dozens of long-term stable tumor cell clones were established, in which expression 
of the wild-type pRB can be reversibly turned on and off without significant leakage. The RB- 
reconstituted tumor cell clones were obtained, respectively, from the breast carcinoma cell line, 

25 MDA-MB-468, the osteosarcoma cell line Saos-2, and the bladder carcinoma cell line, 5637. 
These tumor cell lines were chosen as host cells since they were known to contain both RB and 
p53 gene mutations (Wang et al, 1993; Chen et al, 1990; Berry et al, 1996; Masuda et al, 
1987). 
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As measured by western blotting, pRB protein induced in the tumor cells reached the 
highest level about 24 hours after removal of tetracycline from the cell culture medium, and then 
became completely dephosphorylated within 24 to 40 hours. The effects of induction of pRB 
expression on tumor cell growth were subsequently examined in representative clones by 
5 measuring growth curves and ( H) thymidine incorporation (Xu et aL 9 1994b), and by flow 
cytometric analysis (Nicoletti et aL, 1991). Cell growth and DNA synthesis of all the long-term 
tumor cell clones studied ceased 24 to 48 hours after pRB expression was induced (FIG. 3A, 
FIG. 3B and FIG. 3C). The great majority of the tumor cells were arrested at G0/G1 phase of the 
cell cycle. 

10 

After a 4-day induction of pRB expression in Tc-free medium, the growth cessation of 
the tumor cells was irreversible by stimulation with a variety of mitogens, such as serum growth 
y3 factors, phytohemagglutinin (PHA) and concanavalin A (Con A). This was determined by 
lJ continuous flat growth curves as shown in FIG. 3A, FIG. 3B and FIG. 3C and failure of the 
1 S« tumor cells to incorporate ( H) thymidine in response to mitogenic stimulation. In the meantime, 
the tumor cells displayed striking morphological changes consistent with cellular senescence, 
e including cell enlargement, flattening, and lower nucleocytoplasmic ratio than cycling cells. 

p ^ Furthermore, as measured by DNA fragmentation assay, a small amount of lower 

2Q3 molecular weight DNAs were often observed in DNA samples prepared from i?5-reconstituted 
Saos-2 tumor cells grown in non-permissive but not permissive conditions for pRB expression. 
This finding suggested a low level of spontaneous apoptosis of the i?5-defective tumor cell 
culture, which was inhibited by induction of pRB expression. In addition, switching on pRB 
expression in the ^-reconstituted 5637 and MDA-MB-468 tumor cell lines also inhibited 
25 IFN-y-induced apoptotic cell death. 

Expression of senescence-associated $-galactosidase 

A biomarker that identifies senescent human cells in culture and in aging skin in vivo has 
recently been reported. This marker, termed senescence-associated p-galactosidase (SA-p-gal), 
30 is expressed by senescent, but not pre-senescent fibroblasts. SA-p-gal was also absent from 
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immortal cells but was induced by genetic manipulations that reversed immortality (Dimri et aL, 
1995). Young (early passage) human WI-38 fibroblasts were SA-p-gal negative, whereas the 
senescent (at population doubling level greater than 52) WI-38 cells were strongly SA-p-gal 
positive, which provided a valid control for the SA-(3-gal assay. The Tc-responsive RB- 
5 reconstituted tumor cell clones were totally SA-p-gal negative in the presence of Tc {i.e., in RB" 
status), and the majority of the tumor cells became SA-p-gal positive after induction of pRB 
expression for four to five days in Tc-free medium. The detection of this senescence-associated 
biomarker in the tumor cells was coincident with the irreversible growth cessation of the tumor 
cell populations (FIG. 3A, FIG. 3B and FIG. 3C). The intensity of the SA-p-gal staining of the 
1 0 induced RB + tumor cells, however, was variable depending on the tumor cell types. 

^ Reexpression of pRB (but not p53) in tumor cells inhibited telomerase activity 

^3 Since telomerase has recently emerged as an attractive candidate for a regulator in 

flj cellular senescence (Linskens et aL, 1995; Klingelhutz et aL, 1996), the effects of pRB and p53 

1|* replacement on the telomerase activity of the host tumor cells were determined. In this 
connection, several long-term stable tumor cell clones with Tc-regulatable wild-type p53 
s expression from the osteosarcoma cell line, Saos-2 were established. A telomeric repeat 
fij amplification protocol (TRAP) assay as recently described (Kim et al 9 1994) was used to 
\I measure telomerase activity in tumor cells before and after induction of pRB (or p53) expression. 

2<6 

Prior to induction of pRB expression, the ^-reconstituted tumor cell clones from all 
three 7£B/p53-defective tumor types examined were positive for telomerase activity, whereas the 
relative telomerase activity was -15 to >100 times lower in the tumor cells after turning on the 
pRB expression as estimated by densitometry of the digitized image. In fact, the telomerase 
25 activity was nearly non-detectable in the pRB-expressing MDA-MB-468 and Saos-2 tumor cells. 
In contrast, although induction of wild-type p53 expression in Saos-2 did result in growth arrest 
of the RB7p53 nu11 tumor cells, the p5 3 -reconstituted Saos-2 tumor clones persistently exhibited 
positive telomerase activity, which was not affected by their p53 status. Thus the differences in 
telomerase activity cannot be explained simply as a difference in cell proliferation. 
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Post senescent crisis of pRB-induced senescent tumor cells after withdrawal of pRB 

The pRB-induced tumor cell senescence was stringently dependent on the continued 
expression of the functional pRB. As shown above, after induction of pRB expression in Tc-free 
5 medium for four or more days, the ^-reconstituted MDA-MB-468, Saos-2, and 5637 tumor 
cells became senescent. When these tumor cells returned to an non-permissive medium for pRB 
expression, however, a large number of tumor cells were observed that lost cell-cell adherence, 
detached from the Petri dishes and died. To further characterize this phenomenon, a combined 
method was employed involving pRB immunocytochemical staining and ( 3 H) thymidine in situ 
1 0 labeling of the tumor cells. 

^ It was found that after adding 0.5 (ig/ml of Tc back to the /^-reconstituted Saos-2 tumor 

|f cell cultures that had been maintained in Tc-free medium for 4 to 5 days, nearly all tumor cells 
pi were depleted of the exogenous pRB and became RB" at day 6. Subsequently, at day 9 to 10, the 
\%z tumor cells resumed DNA synthesis, the majority of which however had strikingly aberrant 
*L] nuclei. They attempted to divide but most died in the process. These tumor cells displayed a 
- phenotype, showing remarkable similarity to postsenescent crisis of the T-antigen-transformed 
|l| human cells in late passage (Stein, 1985). 

2|M In summary, reexpression of functional pRB in ^-defective tumor cells induced growth 

cessation concurrently with inhibition of telomerase activity. The tumor cells irreversibly lost 
mitogen responsiveness, entering a viable Gl-arrested state. They also exhibited pRB-dependent 
SA-p-gal positivity (a senescence-associated biomarker) and resistance to apoptotic cell death. 
Of note, replacement of either wild-type pRB or p53 in the RB7p53 nu11 Saos-2 was able to block 

25 tumor cell growth at the population level, but only pRB induced inhibition of telomerase. 
Furthermore, withdrawal of pRB in pRB-induced senescent tumor cells led to a crisis-like 
phenotype. These observations, taken together, suggest pRB is causally involved in the cellular 
senescence program. These results are the first direct evidence that overexpression of pRB alone 
in a variety of £B-defective tumor cells was sufficient to reverse their immortality and cause 

30 bona fide replicative senescence. Since all three 7?5-defective tumor cell lines examined also 
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have p53 mutations, the pRB-mediated tumor cell senescence apparently do not require 
wild-type p53 function. 

Thus a new link between pRB and telomerase is shown. It is demonstrated, by a 
5 telomeric repeat amplification protocol (TRAP) assay, that reexpression of pRB in i?5-defective 
tumor cells inhibits telomerase activity. Because of the high sensitivity of the polymerase chain 
reaction (PCR™)-based TRAP assay, which detects the enzyme activity in a very small number 
of telomerase positive cells, and the difficulty in obtaining absolutely pure ^-reconstituted cell 
clones, the effectiveness of pRB reexpression on inhibition of telomerase activity in &B-defective 
10 tumor cells was likely even greater than it had been detected by the in vitro assay. 

_ It is also noteworthy that, when maintained in non-permissive conditions for pRB (or 

'42 p53) expression, the pRB -reconstituted Saos-2 clone apparently had much lower telomerase 
flj activity than the p53 -reconstituted Saos-2 clone. The difference implies that, even before 

1% switching-on of the pRB expression in Tc-free medium, there must be low baseline expression of 
pRB from the Tc-responsive promoter in Saos-2 cells (Gossen and Bujard, 1995). The leakiness 
3 of pRB in pRB -reconstituted tumor cells under non-permissive conditions is below the 
«1 immunodetection threshold for pRB protein (Xu et al, 1991b), but it might be sufficient to 
^ inhibit the most telomerase activity. Since the tumor cells lacking telomerase activity likely 

IMi resume telomere decline, this would eventually trigger the intrinsic cellular senescence program 
if it remains intact in the tumor cells. 

* * # 

25 All of the compositions and methods disclosed and claimed herein can be made and 

executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied to the 
compositions and methods and in the steps or in the sequence of steps of the methods described 

30 herein without departing from the concept, spirit and scope of the invention. More specifically, it 
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will be apparent that certain agents which are both chemically and physiologically related may be 
substituted for the agents described herein while the same or similar results would be achieved. 
All such similar substitutes and modifications apparent to those skilled in the art are deemed to 
be within the spirit, scope and concept of the invention as defined by the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Xu, Hong-Ji 
Hu, Shi-Xue 
Benedict, William F. 
Zhou, Yunli 

(ii) TITLE OF INVENTION: MODIFIED RETINOBLASTOMA TUMOR SUPPRESSOR 
PROTEINS 

(iii) NUMBER OF SEQUENCES: 51 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Arnold, White & Durkee 

(B) STREET: P.O. Box 4433 

(C) CITY: Houston 

(D) STATE: TX 

(E) COUNTRY: USA 

(F) ZIP: 77210-4433 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US UNKNOWN 

(B) FILING DATE: Concurrently Herewith 

(C) CLASSIFICATION: UNKNOWN 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/038,118 

(B) FILING DATE: 20-FEB-1997 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Hibler, David W. 

(B) REGISTRATION NUMBER: 41,071 

(C) REFERENCE/DOCKET NUMBER: UTXC:506 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 512/418-3000 

(B) TELEFAX: 512/474-7577 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 7. .2790 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 

OCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 

GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 240 
Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 

GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 288 
Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys 
80 85 90 

AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 3 36 

Lys Lys Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu 
95 100 105 110 

GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 384 
Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He 
115 120 125 

AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 432 
Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr 
130 135 140 

AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 480 
Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
145 150 155 

TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 528 
Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu 
160 165 170 
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ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG 576 
Thr Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val 
175 180 185 190 

CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA 624 
Leu Lys Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu 
195 200 205 

CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC 672 
Gin Met Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val 
210 215 220 

CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA 720 
Leu Asp Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro 
225 230 235 

TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG 768 
Tyr Lys Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg 
240 245 250 

CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT 816 
Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu Glu Asn Asp 
255 260 265 270 

ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT 864 
Thr Arg He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp 
275 280 285 

GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT 912 
Glu Val Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser 
290 295 300 

CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT 960 
Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser 
305 310 315 

AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA 1008 
Lys Arg Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg 
320 325 330 

TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT 1056 
Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser 
335 340 345 350 

TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG 1104 
Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val 
355 360 365 

AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC 1152 
Asn Val He Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He 
370 375 380 
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CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA 12 00 

Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu 
385 390 395 

AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT 1248 
Asn Leu He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser 
400 405 410 

ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT 12 96 

He Leu Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe 
415 420 425 430 

GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC 1344 
Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr 
435 440 445 

AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA 13 92 

Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys 
450 455 460 

TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT 144 0 

Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn 
465 470 475 

GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA 1488 
Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val 
480 485 490 

ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA 153 6 

Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr 
495 500 505 510 

GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT 1584 
Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe 
515 520 525 

GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG 1632 
Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu 
530 535 540 

ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG 1680 
Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met 
545 550 555 

GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA 172 8 

Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys 
560 565 570 

CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT 1776 
Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys 
575 580 585 590 
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CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT 1824 
Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr 
595 600 605 

CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA 1872 
Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val 
610 615 620 

AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC 192 0 

Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr 
625 630 635 

CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG 1968 
Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val 
640 645 650 

TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG 2016 
Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu 
655 660 665 670 

TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC 2 064 

Ser Glu His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His 
675 680 685 

ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA 2112 
Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin 
690 695 700 

ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC 216 0 

lie Met Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie Asp 
705 710 715 

CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT 22 08 

Leu Lys Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His Ala 
720 725 730 

GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT 2256 
Val Gin Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp 
735 740 745 750 

TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA 23 04 

Ser lie lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr 
755 760 765 

AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA 2352 
Asn lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie 
770 775 780 

CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG 24 00 

Pro His lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg 
785 790 795 
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ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA 2448 
lie Pro Gly Gly Asn lie Tyr lie Ser Pro Leu Lys Ser Pro Tyr Lys 
800 805 810 

ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA 2496 
lie Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg 
815 820 825 830 

ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG 2 544 

He Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin 
835 840 845 

AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT 2 592 

Lys He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser 
850 855 860 

GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT 2640 
Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp 
865 870 875 

ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG 2688 
He Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu 
880 885 890 

TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA 2 736 

Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg 
895 900 905 910 

ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA 2 784 

Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu 
915 920 925 

GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG 2840 
Glu Lys 



TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC 2 900 

TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC 2 960 

ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA 3 020 

TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT 308 0 

AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT 3140 

TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT 32 00 

TTAATTTAAC ATGAAC AC C C TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA 3260 

TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA 332 0 

ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT 338 0 
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ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT 



AAC CATATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT 



TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGTC 



3440 
3500 
3555 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 

Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp Leu Asp Glu 
100 105 110 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser Val 

115 120 125 

His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val 
130 135 140 

Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
145 150 155 160 

Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr Gin 
165 170 175 

Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys 
180 185 190 
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Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
195 200 205 



Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val Leu Asp 
210 215 220 

Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys 
225 230 235 240 

Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly 
245 250 255 

Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu Glu Asn Asp Thr Arg 
260 265 270 

lie lie Glu Val Leu Cys Lys Glu His Glu Cys Asn lie Asp Glu Val 
275 280 285 

Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn Ser Leu Gly 
290 295 300 

Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg 
305 310 315 320 

Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe 
325 330 335 

Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe Glu 
340 345 350 

Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val 
355 360 365 

lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin Gin 
370 375 380 

Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu 
385 390 395 400 

lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie Leu 
405 410 415 

Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys Phe Ala Lys 
420 425 430 

Ala Val Gly Gin Gly Cys Val Glu lie Gly Ser Gin Arg Tyr Lys Leu 
435 440 445 

Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu 
450 455 460 

Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn 
465 470 475 480 
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lie Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala 
485 490 495 



Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu 
500 505 510 

Ser Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe 
515 520 525 

Tyr Lys Val lie Glu Ser Phe lie Lys Ala Glu Gly Asn Leu Thr Arg 
530 535 540 

Glu Met lie Lys His Leu Glu Arg Cys Glu His Arg lie Met Glu Ser 
545 550 555 560 

Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys Gin Ser 
565 570 575 

Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu 
580 585 590 

Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser 
595 600 605 

Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser 
610 615 620 

Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys 
625 630 635 640 

Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg 
645 650 655 

Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu 
660 665 670 

His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His Thr Leu 
675 680 685 

Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin lie Met 
690 695 700 

Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie Asp Leu Lys 
705 710 715 720 

Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin 
725 730 735 

Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp Ser lie 
740 745 750 

lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn lie 
755 760 765 
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Leu Gin Tyr Ala 
770 

lie Pro Arg Ser 
785 

Gly Gly Asn He 



Glu Gly Leu Pro 
820 

Val Ser He Gly 
835 

Asn Gin Met Val 
850 

Gly Ser Asn Pro 
865 

Gly Ser Asp Glu 



Phe Gin Gin Lys 
900 

Lys Gin Lys Met 
915 



Ser Thr Arg Pro 
775 

Pro Tyr Lys Phe 
790 

Tyr He Ser Pro 
805 

Thr Pro Thr Lys 



Glu Ser Phe Gly 
840 

Cys Asn Ser Asp 
855 

Pro Lys Pro Leu 
870 

Ala Asp Gly Ser 
885 

Leu Ala Glu Met 



Asn Asp Ser Met 
920 



Pro Thr Leu Ser 
780 

Pro Ser Ser Pro 
795 

Leu Lys Ser Pro 
810 

Met Thr Pro Arg 
825 

Thr Ser Glu Lys 



Arg Val Leu Lys 
860 

Lys Lys Leu Arg 
875 

Lys His Leu Pro 
890 

Thr Ser Thr Arg 
905 

Asp Thr Ser Asn 



Pro He Pro His 



Leu Arg He Pro 
800 

Tyr Lys He Ser 
815 

Ser Arg He Leu 
830 

Phe Gin Lys He 
845 

Arg Ser Ala Glu 



Phe Asp He Glu 
880 

Gly Glu Ser Lys 
895 

Thr Arg Met Gin 
910 

Lys Glu Glu Lys 
925 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3218 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 7.. 2454 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GCCGTC ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 48 
Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He 
15 10 

AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 96 
Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr 
15 20 25 30 
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AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 144 
Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
35 40 45 

TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 192 
Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu 
50 55 60 

ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG 240 
Thr Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val 
65 70 75 

CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA 288 
Leu Lys Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu 
80 85 90 

CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC 33 6 

Gin Met Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val 
95 100 105 110 

CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA 384 
Leu Asp Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro 
115 120 125 

TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG 43 2 

Tyr Lys Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg 
130 135 140 

CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT 480 
Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu Glu Asn Asp 
145 150 155 

ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT 52 8 

Thr Arg lie lie Glu Val Leu Cys Lys Glu His Glu Cys Asn lie Asp 
160 165 170 

GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT 576 
Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn Ser 
175 180 185 190 

CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT 624 
Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser 
195 200 205 

AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA 672 
Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg 
210 215 220 

TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT 72 0 

Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser 
225 230 235 
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TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG 768 
Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val 
240 245 250 

AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC 816 
Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie 
255 260 265 270 

CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA 864 
Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu 
275 280 285 

AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT 912 
Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser 
290 295 300 

ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT 960 
lie Leu Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe 
305 310 315 

GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC 1008 
Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr 
320 325 330 

AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA 1056 
Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys 
335 340 345 350 

TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT 1104 
Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn 
355 360 365 

GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA 1152 
Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val 
370 375 380 

ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA 12 0 0 

Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr 
385 390 395 

GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT 1248 
Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe 
400 405 410 

GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG 1296 
Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu 
415 420 425 430 

ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG 1344 
Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met 
435 440 445 
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GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA 1392 
Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys 
450 455 460 

CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT 1440 
Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys 
465 470 475 

CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT 1488 
Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr 
480 485 490 

CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA 1536 
Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val 
495 500 505 510 

AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC 1584 
Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr 
515 520 525 

CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG 1632 
Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val 
530 535 540 

TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG 1680 
Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu 
545 550 555 

TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC 172 8 

Ser Glu His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His 
560 565 570 

ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA 1776 
Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin 
575 580 585 590 

ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC 1824 
He Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp 
595 600 605 

CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT 1872 
Leu Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala 
610 615 620 

GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT 192 0 

Val Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp 
625 630 635 

TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA 1968 
Ser He He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr 
640 645 650 
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AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA 2 016 

Asn lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie 
655 660 665 670 

CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG 2 064 

Pro His lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg 
675 680 685 

ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA 2112 
lie Pro Gly Gly Asn lie Tyr lie Ser Pro Leu Lys Ser Pro Tyr Lys 
690 695 700 

ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA 2160 
lie Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg 
705 710 715 

ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG 22 08 

lie Leu Val Ser lie Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin 
720 725 730 

AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT 22 56 

Lys lie Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser 
735 740 745 750 

GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT 23 04 

Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp 
755 760 765 

ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG 2352 
lie Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu 
770 775 780 

TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA 24 00 

Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg 
785 790 795 

ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA 2448 
Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu 
800 805 810 

GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG 2 504 

Glu Lys 

815 

TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC 2 564 

TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC 2624 

ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA 2684 

TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT 2 744 

AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT 2804 
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TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT 2864 

TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA 2 924 

TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA 2 984 

ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT 3 044 

ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT 3104 

AAC CATATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT 3164 

TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3218 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser Val 
15 10 15 

His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val 
20 25 30 

Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
35 40 45 

Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr Gin 
50 55 60 

Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys 
65 70 75 80 

Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
85 90 95 

Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu Asp 
100 105 110 

Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys 
115 120 125 

Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly 
130 135 140 
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Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg 
145 150 155 160 



He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val 
165 170 175 

Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly 
180 185 190 

Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg 
195 200 205 

Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe 
210 215 220 

Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu 
225 230 235 240 

Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val 
245 250 255 

He Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin 
260 265 270 

Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu 
275 280 285 

He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu 
290 295 300 

Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys 
305 310 315 320 

Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu 
325 330 335 

Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu 
340 345 350 

Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn 
355 360 365 

He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala 
370 375 380 

Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu 
385 390 395 400 

Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe 
405 410 415 

Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg 
420 425 430 
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Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser 
435 440 445 



Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser 
450 455 460 

Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu 
465 470 475 480 

Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser 
485 490 495 

Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser 
500 505 510 

Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys 
515 520 525 

Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg 
530 535 540 

Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu 
545 550 555 560 

His Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His Thr Leu 
565 570 575 

Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He Met 
580 585 590 

Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys 
595 600 605 

Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin 
610 615 620 

Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He 
625 630 635 640 

He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He 
645 650 655 

Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro His 
660 665 670 

He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro 
675 680 685 

Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser 
690 695 700 

Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu 
705 710 715 720 
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Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys 
725 730 735 



He 



Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu 
740 745 750 

Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu 
755 760 765 

Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys 
770 775 780 

Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin 
785 790 795 800 

Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
805 810 815 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTCGAGCAAT GGGCGTGATA GCGGTTTGAC TCACGGGGAT TTCCAAGTCT CCACCCCATT 6 0 

GACGTCAATG GGAGTTTGTT TTGGCACCAA AATCAACGGG ACTTTCCAAA ATGTCGTAAC 12 0 

AACTCCGCCC CATTGACGCA AATGGGCGGT AGGCGTGTAC GGTGGGAGGT CTATATAAGC 180 

AGAGCTCGTT TAGTGAACCG TCAGATCGCC TGGAGACGCC ATCCACGCTG TTTTGACCTC 240 

CATAGAAGAC ACCGGGACCG ATCCAGCCTC CGCGGCCGCG AATTC 285 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CCGCTCGAGC AATGGGCGTG GATAGCGG 28 



(2) INFORMATION FOR SEQ ID NO : 7 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CCGCTCGAGC ACCAAAATCA ACGGGA 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CCGCTCGAGC AACTCCGCCC CATTGAC 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TAGACATATG AATTCGCGGC C 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
CTAGAATTCG CTGTCTGCG 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GCTCTAGATG CAGTTGGACC TGGGAG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 
CCCAAGCTTG CCGCCATGTC GTTCACTTTT AC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
GTCCAAGAGA ATTCATAAAA GG 



(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 
CCCAAGCTTG CCGCCATGGA GCAGGACAGC GGCCCGGAC 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
CCCAAGCTTG CCGCCATGGA TTTTACTGCA TTATGTCAG 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CCCAAGCTTG CCGCCATGGA GAAAGTTTCA TCTTGTGAT 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
CCCAAGCTTG CCGCCATGCT GTGGGGAATC TGTATCTTT 



(2) INFORMATION FOR SEQ ID NO : 18 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
CCCAAGCTTG CCGCCATGTC AAGACTGTTG AAGAAG 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
GCGCCTGAGG AC CTAGATGA GATGTCGTTC 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GCGGTTAACC CTAGATGAGA TGTCGTTCAC T 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
CCCAAGCTTG CCGTCATGCC GCCCAAAACC CCCCGA 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
CTCACCTAGG TCAACTGCTG CAAT 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
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GTTGACCTAG GTGATATGTC GTTC 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GCGCCTAGGA TCTACTGAAA TAAATTCTGC A 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
CCCGATATCA ACTGCTGGGT TGTGTCAAAT A 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
CCCGAATTCG TTTTATATGG TTCTTTGAGC AA 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 4.. 5 
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(D) OTHER INFORMATION: /note= "R=A or G" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GCCRCCAUGG 



10 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7. .2691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

GCCGTC ATG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG CTT 4 8 

Met Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu 
15 10 

GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG AAA 96 
Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys 
15 20 25 30 

TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG GAG 144 
Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu 
35 40 45 

AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG AAA 192 
Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys Lys 
50 55 60 

AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA GAT 240 
Lys Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu Asp 
65 70 75 

GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC AGT 288 
Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser 
80 85 90 

GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC AAA 336 
Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys 
95 100 105 110 

GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG TTT 384 
Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe 
115 120 125 
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GCA CTC TTC AGC AAA TTG GAA AGG AC A TGT GAA CTT ATA TAT TTG ACA 432 
Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr 
130 135 140 

CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG CTA 48 0 

Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu 
145 150 155 

AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA CAA 528 
Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin 
160 165 170 

ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC CTT 576 
Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu 
175 180 185 190 

GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA TAT 624 
Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr 
195 200 205 

AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG CGA 672 
Lys Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg 
210 215 220 

GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT ACA 72 0 

Gly Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr 
225 230 235 

AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT GAG 768 
Arg He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu 
240 245 250 

GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT CTT 816 
Val Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu 
255 260 265 270 

GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT AAA 864 
Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys 
275 280 285 

CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA TTA 912 
Arg Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu 
290 295 300 

TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT TTT 96 0 

Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe 
305 310 315 

GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG AAT 1008 
Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn 
320 325 330 
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GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC CAA 1056 
Val lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin 
335 340 345 350 

CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA AAT 1104 
Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn 
355 360 365 

CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT ATA 1152 
Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie 
370 375 380 

CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT GCT 12 00 

Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys Phe Ala 
385 390 395 

AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC AAA 1248 
Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys 
400 405 410 

CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA TCA 12 96 

Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser 
415 420 425 430 

GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT GAC 1344 
Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp 
435 440 445 

AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA ATG 13 92 

Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met 
450 455 460 

GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA GAT 144 0 

Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp 
465 470 475 

TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT GAT 1488 
Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp 
480 485 490 

TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG ACA 1536 

Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr 
495 500 505 510 

AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG GAA 1584 
Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met Glu 
515 520 525 

TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA CAA 1632 
Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin 
530 535 540 
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TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT CCT 16 80 

Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro 
545 550 555 

CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT CTT 1728 

Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu 
560 565 570 

TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA AAT 1776 

Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn 
575 580 585 590 

TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC CAG 1824 

Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin 
595 600 605 

AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG TAT 1872 

Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr 

610 615 620 

CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG TCT 192 0 

Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser 
625 630 635 

GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC ACC 1968 

Glu His Pro Glu Leu Glu His lie He Trp Thr Leu Phe Gin His Thr 
640 645 650 

CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA ATT 2 016 

Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He 
655 660 665 670 

ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC CTT 2064 

Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu 
675 680 685 

AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT GTT 2112 

Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val 

690 695 700 

CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT TCT 216 0 

Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser 
705 710 715 

ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA AAT 22 08 

He He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn 
720 725 730 

ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA CCT 2256 

He Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro 
735 740 745 750 
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CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG ATT 23 04 

His He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He 
755 760 765 

CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA ATT 2352 
Pro Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He 
770 775 780 

TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA ATC 24 0 0 

Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He 
785 790 795 

TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG AAA 2448 
Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys 
800 805 810 

ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT GCT 2496 
He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala 
815 820 825 830 

GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT ATT 2 544 

Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He 
835 840 845 

GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG TCC 2592 
Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser 
850 855 860 

AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA ATG 2640 
Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met 
865 870 875 

CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA GAG 2688 
Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu 
880 885 890 

AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG 2 741 

Lys 

895 

TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC 28 01 

TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC 2861 

ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA 2 921 

TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT 2981 

AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT 3 041 

TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT 3101 

TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA 3161 
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TTGACTGCCC ATT C AC C AAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA 3221 

ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT 3281 

ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT 3341 

AACCATATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT 3401 

TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3455 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 895 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu Phe 
15 10 15 

Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu Lys 
20 25 30 

lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys Val 
35 40 45 

Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys Lys Lys Glu 
50 55 60 

Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu Asp Glu Met 
65 70 75 80 

Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser Val His 
85 90 95 

Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val Asp 
100 105 110 

Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala Leu 
115 120 125 

Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr Gin Pro 
130 135 140 

Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys Val 
I 45 150 155 160 

Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu 
165 170 175 
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Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr 
180 185 190 

Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr 
195 200 205 

Ala Val lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin 
210 215 220 

Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg He 
225 230 235 240 

He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val Lys 
245 250 255 

Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu 
260 265 270 

Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr 
275 280 285 

Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu 
290 295 300 

Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu Thr 
305 310 315 320 

Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val He 
325 330 335 

Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin Leu 
340 345 350 

Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu He 
355 360 365 

Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu Lys 
370 375 380 

Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala 
385 390 395 400 



Val Gly Gin Gly Cys Val Glu He 
405 

Val Arg Leu Tyr Tyr Arg Val Met 
420 

Glu Arg Leu Ser He Gin Asn Phe 
435 440 

Phe His Met Ser Leu Leu Ala Cys 
450 455 



Gly Ser Gin Arg Tyr Lys Leu Gly 
410 415 

Glu Ser Met Leu Lys Ser Glu Glu 
425 430 

Ser Lys Leu Leu Asn Asp Asn He 
445 

Ala Leu Glu Val Val Met Ala Thr 
460 
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Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser 
465 470 475 480 



Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr 
485 490 495 

Lys Val lie Glu Ser Phe lie Lys Ala Glu Gly Asn Leu Thr Arg Glu 
500 505 510 

Met lie Lys His Leu Glu Arg Cys Glu His Arg lie Met Glu Ser Leu 
515 520 525 

Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys Gin Ser Lys 
530 535 540 

Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn 
545 550 555 560 

Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro 
565 570 575 

Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr 
580 585 590 

Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro 
595 600 605 

Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu 
610 615 620 

Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His 
625 630 635 640 

Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His Thr Leu Gin 
645 650 655 

Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin lie Met Met 
660 665 670 

Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn He Asp Leu Lys Phe 
675 680 685 

Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu 
690 695 700 

Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He He 
705 710 715 720 

Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He Leu 
725 730 735 

Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro His He 
740 745 750 
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Pro Arg Ser Pro 
755 

Gly Asn lie Tyr 
770 

Gly Leu Pro Thr 
785 

Ser lie Gly Glu 



Gin Met Val Cys 
820 

Ser Asn Pro Pro 
835 

Ser Asp Glu Ala 
850 

Gin Gin Lys Leu 
865 

Gin Lys Met Asn 



Tyr Lys Phe Pro 
760 

lie Ser Pro Leu 
775 

Pro Thr Lys Met 
790 

Ser Phe Gly Thr 
805 

Asn Ser Asp Arg 



Lys Pro Leu Lys 
840 

Asp Gly Ser Lys 
855 

Ala Glu Met Thr 
870 

Asp Ser Met Asp 
885 



Ser Ser Pro Leu 



Lys Ser Pro Tyr 
780 

Thr Pro Arg Ser 
795 

Ser Glu Lys Phe 
810 

Val Leu Lys Arg 
825 

Lys Leu Arg Phe 



His Leu Pro Gly 
860 

Ser Thr Arg Thr 
875 

Thr Ser Asn Lys 
890 



Arg lie Pro Gly 
765 

Lys lie Ser Glu 



Arg He Leu Val 
800 

Gin Lys He Asn 
815 

Ser Ala Glu Gly 
830 

Asp He Glu Gly 
845 

Glu Ser Lys Phe 



Arg Met Gin Lys 
880 

Glu Glu Lys 
895 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 7.. 2628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GCCATC ATG GAT TTT ACT GCA TTA TGT CAG AAA TTA AAG ATA CCA GAT 48 
Met Asp Phe Thr Ala Leu Cys Gin Lys Leu Lys He Pro Asp 
15 10 

CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG GAG AAA GTT TCA TCT GTG 96 
His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys Val Ser Ser Val 
15 20 25 30 

GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG AAA AAG GAA CTG TGG GGA 144 
Asp Gly Val Leu Gly Gly Tyr He Gin Lys Lys Lys Glu Leu Trp Gly 
35 40 45 
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ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA GAT GAG ATG TCG TTC ACT 192 
lie Cys He Phe He Ala Ala Val Asp Leu Asp Glu Met Ser Phe Thr 
50 55 60 

TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC AGT GTC CAT AAA TTC TTT 24 0 

Phe Thr Glu Leu Gin Lys Asn He Glu He Ser Val His Lys Phe Phe 
65 70 75 

AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC AAA GTT GAT AAT GCT ATG 288 
Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val Asp Asn Ala Met 
80 85 90 

TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG TTT GCA CTC TTC AGC AAA 33 6 

Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala Leu Phe Ser Lys 
95 100 105 110 

TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG ACA CAA CCC AGC AGT TCG 384 
Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr Gin Pro Ser Ser Ser 
115 120 125 

ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG CTA AAA GTT TCT TGG ATC 432 
He Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys Val Ser Trp He 
130 135 140 

ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA CAA ATG GAA GAT GAT CTG 48 0 

Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu Asp Asp Leu 
145 150 155 

GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC CTT GAC TAT TTT ATT AAA 528 
Val He Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr Phe He Lys 
160 165 170 

CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA TAT AAA ACA GCT GTT ATA 5 76 

Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr Ala Val He 
175 180 185 190 

CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG CGA GGT CAG AAC AGG AGT 624 
Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg Ser 
195 200 205 

GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT ACA AGA ATT ATT GAA GTT 672 
Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg He He Glu Val 
210 215 220 

CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT GAG GTG AAA AAT GTT TAT 72 0 

Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val Lys Asn Val Tyr 
225 230 235 

TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT CTT GGA CTT GTA ACA TCT 76 8 

Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu Val Thr Ser 
240 245 250 
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AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT AAA CGA TAC GAA GAA ATT 816 

Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu lie 
255 260 265 270 

TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA TTA TTT TTG GAT CAT GAT 864 

Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His Asp 

275 280 285 

AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT TTT GAA ACA CAG AGA ACA 912 

Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe Glu Thr Gin Arg Thr 
290 295 300 

CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG AAT GTA ATT CCT CCA CAC 960 

Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val lie Pro Pro His 
305 310 315 

ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC CAA CAA TTA ATG ATG ATT 1008 

Thr Pro Val Arg Thr Val Met Asn Thr lie Gin Gin Leu Met Met lie 
320 325 330 

TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA AAT CTG ATT TCC TAT TTT 1056 

Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu lie Ser Tyr Phe 
335 340 345 350 

AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT ATA CTG AAA AGA GTG AAG 1104 

Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie Leu Lys Arg Val Lys 

355 360 365 

GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT GCT AAA GCT GTG GGA CAG 1152 

Asp lie Gly Tyr lie Phe Lys Glu Lys Phe Ala Lys Ala Val Gly Gin 
370 375 380 

GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC AAA CTT GGA GTT CGC TTG 12 00 

Gly Cys Val Glu lie Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg Leu 
385 390 395 

TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA TCA GAA GAA GAA CGA TTA 124 8 

Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg Leu 
400 405 410 

TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT GAC AAC ATT TTT CAT ATG 12 96 

Ser lie Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn lie Phe His Met 
415 420 425 430 

TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA ATG GCC ACA TAT AGC AGA 1344 

Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser Arg 

435 440 445 

AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA GAT TTG TCT TTC CCA TGG 1392 

Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro Trp 
450 455 460 
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ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT GAT TTT TAC AAA GTG ATC 144 0 

lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val lie 
465 470 475 

GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG ACA AGA GAA ATG ATA AAA 1488 
Glu Ser Phe lie Lys Ala Glu Gly Asn Leu Thr Arg Glu Met lie Lys 
480 485 490 

CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG GAA TCC CTT GCA TGG CTC 153 6 

His Leu Glu Arg Cys Glu His Arg lie Met Glu Ser Leu Ala Trp Leu 
495 500 505 510 

TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA CAA TCA AAG GAC CGA GAA 1584 
Ser Asp Ser Pro Leu Phe Asp Leu lie Lys Gin Ser Lys Asp Arg Glu 
515 520 525 

GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT CCT CTT AAT CTT CCT CTC 1632 
Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro Leu 
530 535 540 

CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT CTT TCT CCT GTA AGA TCT 168 0 

Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg Ser 
545 550 555 

CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA AAT TCT ACT GCA AAT GCA 172 8 

Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn Ala 
560 565 570 

GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC CAG AAG CCA TTG AAA TCT 1776 
Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys Ser 
575 580 585 590 

ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG TAT CGG CTA GCC TAT CTC 1824 
Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr Leu 
595 600 605 

CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG TCT GAG CAC CCA GAA TTA 1872 
Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His Pro Glu Leu 
610 615 620 

GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC ACC CTG CAG AAT GAG TAT 1920 
Glu His lie lie Trp Thr Leu Phe Gin His Thr Leu Gin Asn Glu Tyr 
625 630 635 

GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA ATT ATG ATG TGT TCC ATG 1968 
Glu Leu Met Arg Asp Arg His Leu Asp Gin lie Met Met Cys Ser Met 
640 645 650 

TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC CTT AAA TTC AAA ATC ATT 2016 
Tyr Gly lie Cys Lys Val Lys Asn lie Asp Leu Lys Phe Lys lie lie 
655 660 665 670 
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GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT GTT CAG GAG ACA TTC AAA 2064 
Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe Lys 
675 680 685 

CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT TCT ATT ATA GTA TTC TAT 2112 
Arg Val Leu lie Lys Glu Glu Glu Tyr Asp Ser lie lie Val Phe Tyr 
690 695 700 

AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA AAT ATT TTG CAG TAT GCT 216 0 

Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn lie Leu Gin Tyr Ala 
705 710 715 

TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA CCT CAC ATT CCT CGA AGC 22 08 

Ser Thr Arg Pro Pro Thr Leu Ser Pro lie Pro His lie Pro Arg Ser 
720 725 730 

CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG ATT CCT GGA GGG AAC ATC 2256 
Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg lie Pro Gly Gly Asn lie 
735 740 745 750 

TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA ATT TCA GAA GGT CTG CCA 23 04 

Tyr lie Ser Pro Leu Lys Ser Pro Tyr Lys lie Ser Glu Gly Leu Pro 
755 760 765 

ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA ATC TTA GTA TCA ATT GGT 2 352 

Thr Pro Thr Lys Met Thr Pro Arg Ser Arg lie Leu Val Ser lie Gly 
770 775 780 

GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG AAA ATA AAT CAG ATG GTA 2400 
Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys lie Asn Gin Met Val 
785 790 795 

TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT GCT GAA GGA AGC AAC CCT 2448 
Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly Ser Asn Pro 
800 805 810 

CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT ATT GAA GGA TCA GAT GAA 2496 
Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp lie Glu Gly Ser Asp Glu 
815 820 825 830 

GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG TCC AAA TTT CAG CAG AAA 2544 
Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe Gin Gin Lys 
835 840 845 

CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA ATG CAA AAG CAG AAA ATG 2 592 

Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys Gin Lys Met 
850 855 860 

AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA GAG AAA TGAGGATCTC 2 638 

Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
865 870 

AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG TCTCTCACAG ATGTGACTGT 2698 
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ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC TTCAGCTCTT TTTGTGGATA 2 758 

TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC ACTTGAAATG TTAGTCATTG 2818 

TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA TTTAAAAAGT TGTAGCAGAT 287 8 

TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT AGTAAGAATG GCCCTAGAGT 2 93 8 

GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT TCTTTTGTAG CATATAGGTG 2998 

ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT TTAATTTAAC ATGAACACCC 3 058 

TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA TTGACTGCCC ATTCACCAAA 3118 

ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA ATTAGAAAAA AATTACTAAT 317 8 

TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT ACTGTGTGCT TGTTTTATAA 3238 

AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT AAC CATATGA TACTATCATA 3298 

CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT TACTGATTAT TTTCTTCATC 335 8 

CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3392 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 874 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Asp Phe Thr Ala Leu Cys Gin Lys Leu Lys lie Pro Asp His Val 
15 10 15 

Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys Val Ser Ser Val Asp Gly 
20 25 30 

Val Leu Gly Gly Tyr lie Gin Lys Lys Lys Glu Leu Trp Gly lie Cys 
35 40 45 

lie Phe lie Ala Ala Val Asp Leu Asp Glu Met Ser Phe Thr Phe Thr 
50 55 60 

Glu Leu Gin Lys Asn lie Glu lie Ser Val His Lys Phe Phe Asn Leu 
65 70 75 80 

Leu Lys Glu He Asp Thr Ser Thr Lys Val Asp Asn Ala Met Ser Arg 
85 90 95 
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Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala Leu Phe Ser Lys Leu Glu 
100 105 110 



Arg Thr Cys Glu Leu lie Tyr Leu Thr Gin Pro Ser Ser Ser lie Ser 
115 120 125 

Thr Glu lie Asn Ser Ala Leu Val Leu Lys Val Ser Trp lie Thr Phe 
130 135 140 

Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu Asp Asp Leu Val lie 
145 150 155 160 

Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr Phe lie Lys Leu Ser 
165 170 175 

Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr Ala Val lie Pro lie 
180 185 190 

Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg Ser Ala Arg 
195 200 205 

lie Ala Lys Gin Leu Glu Asn Asp Thr Arg lie lie Glu Val Leu Cys 
210 215 220 

Lys Glu His Glu Cys Asn lie Asp Glu Val Lys Asn Val Tyr Phe Lys 
225 230 235 240 

Asn Phe lie Pro Phe Met Asn Ser Leu Gly Leu Val Thr Ser Asn Gly 
245 250 255 

Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu lie Tyr Leu 
260 265 270 

Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr 
275 280 285 

Leu Gin Thr Asp Ser lie Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg 
290 295 300 

Lys Ser Asn Leu Asp Glu Glu Val Asn Val lie Pro Pro His Thr Pro 
305 310 315 320 

Val Arg Thr Val Met Asn Thr lie Gin Gin Leu Met Met lie Leu Asn 
325 330 335 

Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu lie Ser Tyr Phe Asn Asn 
340 345 350 

Cys Thr Val Asn Pro Lys Glu Ser lie Leu Lys Arg Val Lys Asp lie 
355 360 365 

Gly Tyr lie Phe Lys Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys 
370 375 380 
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Val Glu He Gly 
385 

Arg Val Met Glu 



Gin Asn Phe Ser 
420 

Leu Ala Cys Ala 
435 

Ser Gin Asn Leu 
450 



Ser Gin Arg Tyr 
390 

Ser Met Leu Lys 
405 

Lys Leu Leu Asn 



Leu Glu Val Val 
440 

Asp Ser Gly Thr 
455 



Lys Leu Gly Val 
395 

Ser Glu Glu Glu 
410 

Asp Asn He Phe 
425 

Met Ala Thr Tyr 



Asp Leu Ser Phe 
460 



Arg Leu Tyr Tyr 
400 

Arg Leu Ser He 
415 

His Met Ser Leu 
430 

Ser Arg Ser Thr 
445 

Pro Trp He Leu 



Asn Val Leu Asn 
465 

Phe He Lys Ala 



Glu Arg Cys Glu 
500 

Ser Pro Leu Phe 
515 

Thr Asp His Leu 
530 

Asn His Thr Ala 
545 

Lys Lys Gly Ser 

Gin Ala Thr Ser 
580 



Leu Lys Ala Phe 
470 

Glu Gly Asn Leu 
485 

His Arg He Met 



Asp Leu He Lys 
520 

Glu Ser Ala Cys 
535 

Ala Asp Met Tyr 
550 

Thr Thr Arg Val 
565 

Ala Phe Gin Thr 



Asp Phe Tyr Lys 
475 

Thr Arg Glu Met 
490 

Glu Ser Leu Ala 
505 

Gin Ser Lys Asp 



Pro Leu Asn Leu 
540 

Leu Ser Pro Val 
555 

Asn Ser Thr Ala 
570 

Gin Lys Pro Leu 
585 



Val He Glu Ser 
480 

He Lys His Leu 
495 

Trp Leu Ser Asp 
510 

Arg Glu Gly Pro 
525 

Pro Leu Gin Asn 



Arg Ser Pro Lys 
560 

Asn Ala Glu Thr 
575 

Lys Ser Thr Ser 
590 



Leu Ser Leu Phe 
595 

Asn Thr Leu Cys 
610 

He He Trp Thr 
625 

Met Arg Asp Arg 



He Cys Lys Val 
660 



Tyr Lys Lys Val 
600 

Glu Arg Leu Leu 
615 

Leu Phe Gin His 
630 

His Leu Asp Gin 
645 

Lys Asn He Asp 



Tyr Arg Leu Ala 



Ser Glu His Pro 
620 

Thr Leu Gin Asn 
635 

He Met Met Cys 
650 

Leu Lys Phe Lys 
665 



Tyr Leu Arg Leu 
605 

Glu Leu Glu His 



Glu Tyr Glu Leu 
640 

Ser Met Tyr Gly 
655 

He He Val Thr 
670 
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Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe Lys Arg Val 
675 680 685 



Leu lie Lys Glu Glu 
690 

Val Phe Met Gin Arg 
705 

Arg Pro Pro Thr Leu 
725 

Lys Phe Pro Ser Ser 
740 

Ser Pro Leu Lys Ser 
755 

Thr Lys Met Thr Pro 
770 

Phe Gly Thr Ser Glu 
785 

Ser Asp Arg Val Leu 
805 

Pro Leu Lys Lys Leu 
820 

Gly Ser Lys His Leu 
835 

Glu Met Thr Ser Thr 
850 

Ser Met Asp Thr Ser 
865 



Glu Tyr Asp Ser lie lie 
695 

Leu Lys Thr Asn He Leu 

710 715 

Ser Pro He Pro His He 
730 

Pro Leu Arg He Pro Gly 
745 

Pro Tyr Lys He Ser Glu 
760 

Arg Ser Arg He Leu Val 
775 

Lys Phe Gin Lys He Asn 
790 795 

Lys Arg Ser Ala Glu Gly 
810 

Arg Phe Asp He Glu Gly 
825 

Pro Gly Glu Ser Lys Phe 
840 

Arg Thr Arg Met Gin Lys 
855 

Asn Lys Glu Glu Lys 
870 



Val Phe Tyr Asn Ser 
700 

Gin Tyr Ala Ser Thr 
720 

Pro Arg Ser Pro Tyr 
735 

Gly Asn He Tyr He 
750 

Gly Leu Pro Thr Pro 
765 

Ser He Gly Glu Ser 
780 

Gin Met Val Cys Asn 
800 

Ser Asn Pro Pro Lys 
815 

Ser Asp Glu Ala Asp 
830 

Gin Gin Lys Leu Ala 
845 

Gin Lys Met Asn Asp 
860 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 2559 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
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GCCATC ATG GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT 48 
Met Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr 
15 10 

ATT CAA AAG AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA 96 
He Gin Lys Lys Lys Glu Leu Trp Gly He Cys He Phe He Ala Ala 
15 20 25 30 

GTT GAC CTA GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC 144 
Val Asp Leu Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn 
35 40 45 

ATA GAA ATC AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT 192 
He Glu He Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp 
50 55 60 

ACC AGT ACC AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT 24 0 

Thr Ser Thr Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr 
65 70 75 



GAT GTA TTG TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT 
Asp Val Leu Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu 
80 85 90 



CTA TGT GTC CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC 
Leu Cys Val Leu Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu 
145 150 155 



288 



ATA TAT TTG ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT 3 36 

He Tyr Leu Thr Gin Pro Ser Ser Ser He Ser Thr Glu He Asn Ser 
95 100 105 110 

GCA TTG GTG CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG 3 84 

Ala Leu Val Leu Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly 
H5 120 125 

GAA GTA TTA CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG 432 
Glu Val Leu Gin Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met 
130 135 140 



480 



AAA GAA CCA TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA 528 
Lys Glu Pro Tyr Lys Thr Ala Val He Pro He Asn Gly Ser Pro Arg 
160 165 170 

ACA CCC AGG CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA 576 
Thr Pro Arg Arg Gly Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu 
275 180 185 190 

GAA AAT GAT ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT 624 
Glu Asn Asp Thr Arg He He Glu Val Leu Cys Lys Glu His Glu Cys 
195 200 205 
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AAT ATA GAT GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT 672 
Asn lie Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe 
210 215 220 

ATG AAT TCT CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA 72 0 

Met Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu 
225 230 235 

AAT CTT TCT AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA 768 
Asn Leu Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu 
240 245 250 

GAT GCA AGA TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT 816 
Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser 
255 260 265 270 

ATA GAC AGT TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT 864 
lie Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp 
275 280 285 

GAA GAG GTG AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG 912 
Glu Glu Val Asn Val He Pro Pro His Thr Pro Val Arg Thr Val Met 
290 295 300 

AAC ACT ATC CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA 960 
Asn Thr He Gin Gin Leu Met Met He Leu Asn Ser Ala Ser Asp Gin 
305 310 315 

CCT TCA GAA AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA 1008 
Pro Ser Glu Asn Leu He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro 
320 325 330 

AAA GAA AGT ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA 1056 
Lys Glu Ser He Leu Lys Arg Val Lys Asp He Gly Tyr He Phe Lys 
335 340 345 350 

GAG AAA TTT GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA 1104 
Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser 
355 360 365 

CAG CGA TAC AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC 1152 
Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser 
370 375 380 

ATG CTT AAA TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA 12 00 

Met Leu Lys Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys 
385 390 395 

CTT CTG AAT GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT 1248 
Leu Leu Asn Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu 
400 405 410 
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GAG GTT GTA ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT 12 96 

Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp 
415 420 425 430 

TCT GGA ACA GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA 1344 
Ser Gly Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu 
435 440 445 

AAA GCC TTT GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA 13 92 

Lys Ala Phe Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu 
450 455 460 

GGC AAC TTG ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT 1440 
Gly Asn Leu Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His 
465 470 475 

CGA ATC ATG GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT 1488 
Arg He Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp 
480 485 490 

CTT ATT AAA CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA 1536 
Leu He Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu 
495 500 505 510 

TCT GCT TGT CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA 1584 
Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala 
515 520 525 

GAT ATG TAT CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT 1632 
Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr 
530 535 540 

ACG CGT GTA AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC 1680 
Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala 
545 550 555 

TTC CAG ACC CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT 1728 
Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr 
560 565 570 

AAA AAA GTG TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA 1776 
Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu 
575 580 585 590 

CGC CTT CTG TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT 1824 
Arg Leu Leu Ser Glu His Pro Glu Leu Glu His He He Trp Thr Leu 
595 600 605 

TTC CAG CAC ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT 1872 
Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His 
610 615 620 
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TTG GAC CAA ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG 192 0 

Leu Asp Gin lie Met Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys 
625 630 635 

AAT ATA GAC CTT AAA TTC AAA ATC ATT GTA ACA GCA TAG AAG GAT CTT 1968 
Asn lie Asp Leu Lys Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu 
640 645 650 

CCT CAT GCT GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG 2 016 

Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu 
655 660 665 670 

GAG TAT GAT TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA 2064 
Glu Tyr Asp Ser lie lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg 
675 680 685 

CTG AAA ACA AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG 2112 
Leu Lys Thr Asn He Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu 
690 695 700 

TCA CCA ATA CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA 216 0 

Ser Pro He Pro His He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser 
705 710 715 

CCC TTA CGG ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT 22 08 

Pro Leu Arg He Pro Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser 
720 725 730 

CCA TAT AAA ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA 2256 
Pro Tyr Lys He Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro 
735 740 745 750 

AGA TCA AGA ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG 23 04 

Arg Ser Arg He Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu 
755 760 765 

AAG TTC CAG AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC 2352 
Lys Phe Gin Lys He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu 
770 775 780 

AAA AGA AGT GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA 24 00 

Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu 
785 790 795 

CGC TTT GAT ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC 2448 
Arg Phe Asp He Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu 
800 805 810 

CCA GGA GAG TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT 2496 
Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr 
815 820 825 830 
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CGA ACA CGA ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA 2544 
Arg Thr Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser 
835 840 845 

AAC AAG GAA GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT 2599 
Asn Lys Glu Glu Lys 
850 

GGATTCATTG TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC 26 5 9 

ATTTAATATC TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA 2 719 

TTCCTAAGCC ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA 2 77 9 

AATCCTGCCA TTTAAAAAGT TGTAGCAGAT TGTTTCCTGT TCCAAAGTAA AATTGCTGTG 283 9 

CTTTATGGAT AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC 2899 

TACTTTGCCT TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT 2 959 

GTATATTTTT TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA 3 019 

TGCAATTTGA TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT 3 079 

ATTATTAGAA ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG 3139 

AATCTGATAT ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA 3199 

AGCAAAGTAT AAC CAT ATG A TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG 325 9 

TAAAAGAACT TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA 3319 

TAGT 3323 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Met Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin 
15 10 15 

Lys Lys Lys Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp 
20 25 30 

Leu Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu 
35 40 45 
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He Ser Val His 
50 

Thr Lys Val Asp 
65 

Leu Phe Ala Leu 



Leu Thr Gin Pro 
100 

Val Leu Lys Val 
115 

Leu Gin Met Glu 
130 

Val Leu Asp Tyr 
145 

Pro Tyr Lys Thr 



Arg Arg Gly Gin 
180 

Asp Thr Arg He 
195 

Asp Glu Val Lys 
210 

Ser Leu Gly Leu 
225 

Ser Lys Arg Tyr 



Arg Leu Phe Leu 
260 

Ser Phe Glu Thr 
275 

Val Asn Val He 
290 

He Gin Gin Leu 
305 

Glu Asn Leu He 



Lys Phe Phe Asn 
55 

Asn Ala Met Ser 
70 

Phe Ser Lys Leu 
85 

Ser Ser Ser He 



Ser Trp He Thr 
120 

Asp Asp Leu Val 
135 

Phe He Lys Leu 
150 

Ala Val He Pro 
165 

Asn Arg Ser Ala 



He Glu Val Leu 
200 

Asn Val Tyr Phe 
215 

Val Thr Ser Asn 
230 

Glu Glu He Tyr 
245 

Asp His Asp Lys 



Gin Arg Thr Pro 
280 

Pro Pro His Thr 
295 

Met Met He Leu 
310 

Ser Tyr Phe Asn 
325 



Leu Leu Lys Glu 
60 

Arg Leu Leu Lys 
75 

Glu Arg Thr Cys 
90 

Ser Thr Glu He 
105 

Phe Leu Leu Ala 



He Ser Phe Gin 
140 

Ser Pro Pro Met 
155 

He Asn Gly Ser 
170 

Arg He Ala Lys 
185 

Cys Lys Glu His 



Lys Asn Phe He 
220 

Gly Leu Pro Glu 
235 

Leu Lys Asn Lys 
250 

Thr Leu Gin Thr 
265 

Arg Lys Ser Asn 



Pro Val Arg Thr 
300 

Asn Ser Ala Ser 
315 

Asn Cys Thr Val 
330 



He Asp Thr Ser 



Lys Tyr Asp Val 
80 

Glu Leu He Tyr 
95 

Asn Ser Ala Leu 
110 

Lys Gly Glu Val 
125 

Leu Met Leu Cys 



Leu Leu Lys Glu 
160 

Pro Arg Thr Pro 
175 

Gin Leu Glu Asn 
190 

Glu Cys Asn He 
205 

Pro Phe Met Asn 



Val Glu Asn Leu 
240 

Asp Leu Asp Ala 
255 

Asp Ser He Asp 
270 

Leu Asp Glu Glu 
285 

Val Met Asn Thr 



Asp Gin Pro Ser 
320 

Asn Pro Lys Glu 
335 
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Ser lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys 
340 345 350 



Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg 
355 360 365 

Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu 
370 375 380 

Lys Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu 
385 390 395 400 

Asn Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val 
405 410 415 

Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly 
420 425 430 

Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala 
435 440 445 

Phe Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn 
450 455 460 

Leu Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He 
465 470 475 480 

Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He 
485 490 495 

Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala 
500 505 510 

Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met 
515 520 525 

Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg 
530 535 540 

Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin 
545 550 555 560 

Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys 
565 570 575 

Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu 
580 585 590 

Leu Ser Glu His Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin 
595 600 605 

His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp 
610 615 620 
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Gin lie Met Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie 
625 630 635 640 



Asp Leu Lys Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His 
645 650 655 

Ala Val Gin Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr 
660 665 670 

Asp Ser lie lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys 
675 680 685 

Thr Asn lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro 
690 695 700 

lie Pro His lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu 
705 710 715 720 

Arg lie Pro Gly Gly Asn lie Tyr lie Ser Pro Leu Lys Ser Pro Tyr 
725 730 735 

Lys lie Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser 
740 745 750 

Arg lie Leu Val Ser lie Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe 
755 760 765 

Gin Lys lie Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg 
770 775 780 

Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe 
785 790 795 800 

Asp lie Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly 
805 810 815 

Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr 
820 825 830 

Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys 
835 840 845 

Glu Glu Lys 
850 



(2) INFORMATION FOR SEQ ID NO: 34: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3266 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



-174- 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 7.. 2 5 02 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GCCATC ATG CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 48 
Met Leu Trp Gly lie Cys lie Phe He Ala Ala Val Asp Leu 
15 10 

GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 96 
Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He 
15 20 25 30 

AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 144 
Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr 
35 40 45 

AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 192 
Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
50 55 60 

TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 240 
Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu 
65 70 75 

ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG 288 
Thr Gin Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val 
80 85 90 

CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA 336 
Leu Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu 
95 100 105 110 

CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC 384 
Gin Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val 
115 120 125 

CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA 432 
Leu Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro 
130 135 140 

TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG 48 0 

Tyr Lys Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg 
145 150 155 

CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT 528 
Arg Gly Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp 
160 165 170 

ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT 576 
Thr Arg He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp 
175 180 185 190 
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GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT 624 

Glu Val Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser 

195 200 205 

CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT 672 

Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser 

210 215 220 

AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA 720 

Lys Arg Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg 

225 230 235 

TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT 768 

Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser 
240 245 250 

TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG 816 
Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val 

255 260 265 270 

AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC 864 

Asn Val He Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He 

275 280 285 

CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA 912 

Gin Gin Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu 

290 295 300 

AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT 96 0 

Asn Leu He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser 

305 310 315 

ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT 1008 

He Leu Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe 
320 325 330 

GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC 1056 

Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr 

335 340 345 350 

AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA 1104 

Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys 

355 360 365 

TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT 1152 

Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn 

370 375 380 

GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA 12 00 

Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val 

385 390 395 
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ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA 124 8 

Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr 
400 405 410 

GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT 1296 

Asp Leu Ser Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe 
415 420 425 430 

GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG 1344 

Asp Phe Tyr Lys Val lie Glu Ser Phe lie Lys Ala Glu Gly Asn Leu 
435 440 445 

ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG 1392 

Thr Arg Glu Met lie Lys His Leu Glu Arg Cys Glu His Arg lie Met 
450 455 460 

GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA 144 0 

Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys 

465 470 475 

CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT 148 8 

Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys 
480 485 490 

CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT 1536 

Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr 
495 500 505 510 

CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA 1584 

Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val 
515 520 525 

AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC 1632 

Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr 
530 535 540 

CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG 1680 

Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val 

545 550 555 

TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG 1728 

Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu 
560 565 570 

TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC 1776 

Ser Glu His Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His 
575 580 585 590 

ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA 1824 

Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin 
595 600 605 
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ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC 1872 
He Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp 
610 615 620 

CTT AAA TTC AAA ATC ATT GTA ACA GCA TAG AAG GAT CTT CCT CAT GCT 192 0 

Leu Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala 
625 630 635 

GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT 196 8 

Val Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp 
640 645 650 

TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA 2 016 

Ser He He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr 
655 660 665 670 

AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA 2 064 

Asn He Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He 
675 680 685 

CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG 2112 
Pro His He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg 
690 695 700 

ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA 2160 
He Pro Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys 
705 710 715 

ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA 22 08 

He Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg 
720 725 730 

ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG 22 56 

He Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin 
735 740 745 750 

AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT 2304 
Lys He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser 
755 760 765 

GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT 23 52 

Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp 
770 775 780 

ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG 24 00 

He Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu 
785 790 795 

TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA 2448 
Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg 
800 805 810 
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ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA 24 96 

Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu 
815 820 825 830 

GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG 2552 
Glu Lys 



TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC 2 612 

TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC 2 672 

ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA 2 732 

TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT 2 792 

AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT 2 852 

TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT 2912 

TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA 2972 

TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA 3 032 

ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT 3 092 

ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT 3152 

AACCATATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT 3212 

TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3266 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu Asp Glu 
1 5 10 15 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser Val 
20 25 30 

His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val 
35 40 45 
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Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
50 55 60 



Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr Gin 
65 70 75 80 

Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu Lys 
85 90 95 

Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
100 105 110 

Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu Asp 
115 120 125 

Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys 
130 135 140 

Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly 
145 150 155 160 

Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg 
165 170 175 

He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val 
180 185 190 

Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly 
195 200 205 

Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg 
210 215 220 

Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe 
225 230 235 240 

Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu 
245 250 255 

Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val 

260 265 270 

He Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin 
275 280 285 

Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu 
290 295 300 

He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu 
305 310 315 320 

Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys 
325 330 335 
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Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu 
340 345 350 



Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu 
355 360 365 

Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn 
370 375 380 

He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala 
385 390 395 400 

Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu 
405 410 415 

Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe 
420 425 430 

Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg 
435 440 445 

Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser 
450 455 460 

Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser 
465 470 475 480 

Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu 
485 490 495 

Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser 
500 505 510 

Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser 
515 520 525 

Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys 
530 535 540 

Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg 
545 550 555 560 

Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu 
565 570 575 

His Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His Thr Leu 
580 585 590 

Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He Met 
595 600 605 

Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys 
610 615 620 
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Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin 
625 630 635 640 



Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He 
645 650 655 

He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He 
660 665 670 

Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro His 
675 680 685 

He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro 
690 695 700 

Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser 
705 710 715 720 

Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu 
725 730 735 

Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He 
740 745 750 

Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu 
755 760 765 

Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu 
770 775 780 

Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys 
785 790 795 800 

Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin 
805 810 815 

Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
820 825 830 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3113 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 7.. 2349 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
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GCCGTC ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG TTT GCA CTC 48 
Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala Leu 
1 5 10 

TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG ACA CAA CCC 96 
Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr Gin Pro 
15 20 25 30 

AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG CTA AAA GTT 144 
Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys Val 
35 40 45 

TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA CAA ATG GAA 192 
Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu 
50 55 60 

GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC CTT GAC TAT 240 
Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr 
65 70 75 

TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA TAT AAA ACA 288 
Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr 
80 85 90 

GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG CGA GGT CAG 336 
Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin 
95 100 105 110 

AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT ACA AGA ATT 384 
Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg He 
115 120 125 

ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT GAG GTG AAA 432 
He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val Lys 
130 135 140 

AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT CTT GGA CTT 480 
Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu 
145 150 155 

GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT AAA CGA TAC 528 
Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr 
1^0 165 170 

GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA TTA TTT TTG 576 
Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu 
175 180 185 190 

GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT TTT GAA ACA 624 
Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu Thr 
195 200 205 
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CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG AAT GTA ATT 672 
Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val lie 
210 215 220 

CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC CAA CAA TTA 72 0 

Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin Gin Leu 
225 230 235 

ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA AAT CTG ATT 768 
Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu lie 
240 245 250 

TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT ATA CTG AAA 816 
Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie Leu Lys 
255 260 265 270 

AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT GCT AAA GCT 864 
Arg Val Lys Asp lie Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala 
275 280 285 

GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC AAA CTT GGA 912 
Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly 
290 295 300 

GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA TCA GAA GAA 960 
Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu 
305 310 315 

GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT GAC AAC ATT 1008 
Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn He 
320 325 330 

TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA ATG GCC ACA 1056 
Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr 
335 340 345 350 

TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA GAT TTG TCT 1104 
Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser 
355 360 365 

TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT GAT TTT TAC 1152 
Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr 
370 375 380 

AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG ACA AGA GAA 12 00 

Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg Glu 
385 390 395 

ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG GAA TCC CTT 1248 
Met He Lys His Leu Glu Arg Cys Glu His Arg lie Met Glu Ser Leu 
400 405 410 
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GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA CAA TCA AAG 12 96 

Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys Gin Ser Lys 
415 420 425 430 

GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT CCT CTT AAT 1344 
Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn 
435 440 445 

CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT CTT TCT CCT 13 92 

Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro 
450 455 460 

GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA AAT TCT ACT 1440 
Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr 
465 470 475 

GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC CAG AAG CCA 1488 
Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro 
480 485 490 

TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG TAT CGG CTA 1536 
Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu 
495 500 505 510 

GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG TCT GAG CAC 1584 
Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His 
515 520 525 

CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC ACC CTG CAG 1632 
Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His Thr Leu Gin 
530 535 540 

AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA ATT ATG ATG 168 0 

Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He Met Met 
545 550 555 

TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC CTT AAA TTC 172 8 

Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys Phe 
560 565 570 

AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT GTT CAG GAG 1776 
Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu 
575 580 585 590 

ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT TCT ATT ATA 1824 
Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He He 
595 600 605 

GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA AAT ATT TTG 1872 
Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He Leu 
610 615 620 
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CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA CCT CAC ATT 192 0 

Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie Pro His lie 
625 630 635 

CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG ATT CCT GGA 196 8 

Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg lie Pro Gly 
640 645 650 

GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA ATT TCA GAA 2016 
Gly Asn lie Tyr lie Ser Pro Leu Lys Ser Pro Tyr Lys lie Ser Glu 
655 660 665 670 

GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA ATC TTA GTA 2 064 

Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg lie Leu Val 
675 680 685 

TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG AAA ATA AAT 2112 
Ser lie Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys lie Asn 
690 695 700 

CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT GCT GAA GGA 2160 
Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly 
705 710 715 

AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT ATT GAA GGA 22 08 

Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp lie Glu Gly 
720 725 730 

TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG TCC AAA TTT 2256 
Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe 
735 740 745 750 

CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA ATG CAA AAG 23 04 

Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys 
755 760 765 

CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA GAG AAA 2349 
Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
770 775 780 

TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG TCTCTCACAG 2409 

ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC TTCAGCTCTT 2469 

TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC ACTTGAAATG 252 9 

TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA TTTAAAAAGT 2589 

TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT AGTAAGAATG 264 9 

GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT TCTTTTGTAG 2 709 

CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT TTAATTTAAC 2769 
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ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA TTGACTGCCC 282 9 

ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA ATTAGAAAAA 288 9 

AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT ACTGTGTGCT 2949 

TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT AACCATATGA 3 009 

TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT TACTGATTAT 3 069 

TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3113 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 81 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala Leu Phe Ser 
1 5 10 15 

Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr Gin Pro Ser Ser 
20 25 30 

Ser He Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys Val Ser Trp 
35 40 45 

He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu Asp Asp 
50 55 60 

Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr Phe He 
65 70 75 80 

Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr Ala Val 
85 90 95 

He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg 
100 105 110 

Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg He He Glu 
115 120 125 

Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val Lys Asn Val 
!30 135 140 

Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu Val Thr 
145 150 155 160 
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Ser Asri Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu 
165 170 175 



lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His 
180 185 190 

Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe Glu Thr Gin Arg 
195 200 205 

Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val He Pro Pro 
210 215 220 

His Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin Leu Met Met 
225 230 235 240 

He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu He Ser Tyr 
245 250 255 

Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu Lys Arg Val 
260 265 270 

Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala Val Gly 
275 280 285 

Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg 
290 295 300 

Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg 
305 310 315 320 

Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn He Phe His 
325 330 335 

Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser 
340 345 350 

Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro 
355 360 365 

Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val 
370 375 380 

He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg Glu Met He 
385 390 395 400 

Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser Leu Ala Trp 
405 410 415 

Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser Lys Asp Arg 
420 425 430 

Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro 
435 440 445 
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Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg 
450 455 460 



Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn 
465 470 475 480 

Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys 
485 490 495 

Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr 
500 505 510 

Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His Pro Glu 
515 520 525 

Leu Glu His He He Trp Thr Leu Phe Gin His Thr Leu Gin Asn Glu 
530 535 540 

Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He Met Met Cys Ser 
545 550 555 560 

Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys Phe Lys He 
565 570 575 

He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe 
580 585 590 

Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He He Val Phe 
595 600 605 

Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He Leu Gin Tyr 
610 615 620 

Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro His He Pro Arg 
625 630 635 640 

Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro Gly Gly Asn 
645 650 655 

He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser Glu Gly Leu 
660 665 670 

Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu Val Ser He 
675 680 685 

Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He Asn Gin Met 
690 695 700 

Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly Ser Asn 
705 710 715 720 

Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu Gly Ser Asp 
725 730 735 
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Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe Gin Gin 
740 745 750 

Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys Gin Lys 
755 760 765 

Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
770 775 780 



(2) INFORMATION FOR SEQ ID NO:38: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 7. .2559 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 



CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 



GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 



GTC GAC CTA GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC 144 
Val Asp Leu Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn 
35 40 45 



ATA GAA ATC AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT 192 
lie Glu lie Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp 
50 55 60 



ACC AGT ACC AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT 240 
Thr Ser Thr Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr 
65 70 75 



GAT GTA TTG TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT 2 88 

Asp Val Leu Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu 
80 85 90 



ATA TAT TTG ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT 336 
lie Tyr Leu Thr Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser 
95 100 105 110 
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GCA TTG GTG CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG 384 
Ala Leu Val Leu Lys Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly 
115 120 125 

GAA GTA TTA CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG 432 
Glu Val Leu Gin Met Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met 
130 135 140 

CTA TGT GTC CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC 4 80 

Leu Cys Val Leu Asp Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu 
145 150 155 

AAA GAA CCA TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA 528 
Lys Glu Pro Tyr Lys Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg 
160 165 170 

ACA CCC AGG CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA 5 76 

Thr Pro Arg Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu 
175 180 185 190 

GAA AAT GAT ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT 624 
Glu Asn Asp Thr Arg lie lie Glu Val Leu Cys Lys Glu His Glu Cys 
195 200 205 

AAT ATA GAT GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT 672 
Asn lie Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe 
210 215 220 

ATG AAT TCT CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA 72 0 

Met Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu 
225 230 235 

AAT CTT TCT AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA 768 
Asn Leu Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu 
240 245 250 

GAT GCA AGA TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT 816 
Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser 
255 260 265 270 

ATA GAC AGT TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT 864 

lie Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp 
275 280 285 

GAA GAG GTG AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG 912 
Glu Glu Val Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met 
290 295 300 

AAC ACT ATC CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA 960 
Asn Thr lie Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin 
305 310 315 



-191- 



CCT TCA GAA AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA 1008 
Pro Ser Glu Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro 
320 325 330 

AAA GAA AGT ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA 1056 
Lys Glu Ser lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys 
335 340 345 350 

GAG AAA TTT GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA 1104 
Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser 
355 360 365 

CAG CGA TAC AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC 1152 
Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser 
370 375 380 

ATG CTT AAA TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA 120 0 

Met Leu Lys Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys 
385 390 395 

CTT CTG AAT GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT 1248 
Leu Leu Asn Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu 
400 405 410 

GAG GTT GTA ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT 1296 
Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp 
415 420 425 430 

TCT GGA ACA GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA 1344 
Ser Gly Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu 
435 440 445 

AAA GCC TTT GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA 13 92 

Lys Ala Phe Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu 
450 455 460 

GGC AAC TTG ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT 144 0 

Gly Asn Leu Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His 
465 470 475 

CGA ATC ATG GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT 1488 
Arg He Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp 
480 485 490 

CTT ATT AAA CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA 1536 
Leu He Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu 
495 500 505 510 

TCT GCT TGT CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA 1584 
Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala 
515 520 525 
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GAT ATG TAT CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT 1632 
Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr 
530 535 540 

ACG CGT GTA AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC 168 0 

Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala 
545 550 555 

TTC CAG ACC CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT 172 8 

Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr 
560 565 570 

AAA AAA GTG TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA 1776 
Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu 
575 580 585 590 

CGC CTT CTG TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT 1824 
Arg Leu Leu Ser Glu His Pro Glu Leu Glu His He He Trp Thr Leu 
595 600 605 

TTC CAG CAC ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT 18 72 

Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His 
610 615 620 

TTG GAC CAA ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG 1920 
Leu Asp Gin He Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys 
625 630 635 

AAT ATA GAC CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT 1968 
Asn He Asp Leu Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu 
640 645 650 

CCT CAT GCT GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG 2016 
Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu 
655 660 665 670 

GAG TAT GAT TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA 2064 
Glu Tyr Asp Ser He He Val Phe Tyr Asn Ser Val Phe Met Gin Arg 
675 680 685 

CTG AAA ACA AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG 2112 
Leu Lys Thr Asn He Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu 
690 695 700 

TCA CCA ATA CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA 2160 
Ser Pro He Pro His He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser 
705 710 715 

CCC TTA CGG ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT 22 08 

Pro Leu Arg He Pro Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser 
720 725 730 
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CCA TAT AAA ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA 22 56 

Pro Tyr Lys He Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro 
735 740 745 750 

AGA TCA AGA ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG 23 04 

Arg Ser Arg He Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu 
755 760 765 

AAG TTC CAG AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC 2352 
Lys Phe Gin Lys He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu 
770 775 780 

AAA AGA AGT GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA 240 0 

Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu 
785 790 795 

CGC TTT GAT ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC 2448 
Arg Phe Asp He Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu 
800 805 810 

CCA GGA GAG TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT 2496 
Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr 
815 820 825 830 

CGA ACA CGA ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA 2 544 

Arg Thr Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser 
835 840 845 

AAC AAG GAA GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT 2599 
Asn Lys Glu Glu Lys 
850 

GGATTCATTG TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC 265 9 

ATTTAATATC TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA 2 719 

TTCCTAAGCC ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA 2779 

AATCCTGCCA TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG 283 9 

CTTTATGGAT AGTAAGAATG GCCCTAGAGT GGGAGTCCTG AT AAC C C AGG CCTGTCTGAC 2899 

TACTTTGCCT TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT 2959 

GTATATTTTT TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA 3 019 

TGCAATTTGA TTGACTGCCC ATTC AC C AAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT 3 079 

ATTATTAGAA ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG 313 9 

AATCTGATAT ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA 3199 

AGCAAAGTAT AAC C ATATG A TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG 3259 
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TAAAAGAACT TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA 3319 
TAGT 3323 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Val Asp 
20 25 30 

Leu Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu 
35 40 45 

He Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser 
50 55 60 

Thr Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val 
65 70 75 80 

Leu Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr 
85 90 95 

Leu Thr Gin Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu 
100 105 110 

Val Leu Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val 
115 120 125 

Leu Gin Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys 
130 135 140 

Val Leu Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu 
145 150 155 160 

Pro Tyr Lys Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro 
165 170 175 

Arg Arg Gly Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn 
180 185 190 

Asp Thr Arg He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He 
195 200 205 
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Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn 
210 215 220 



Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu 
225 230 235 240 

Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala 
245 250 255 

Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp 
260 265 270 

Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu 
275 280 285 

Val Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr 
290 295 300 

lie Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser 
305 310 315 320 

Glu Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu 
325 330 335 

Ser lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys 
340 345 350 

Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu lie Gly Ser Gin Arg 
355 360 365 

Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu 
370 375 380 

Lys Ser Glu Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys Leu Leu 
385 390 395 400 

Asn Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val 
405 410 415 

Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly 
420 425 430 

Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala 
435 440 445 

Phe Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn 
450 455 460 

Leu Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He 
465 470 475 480 

Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He 
485 490 495 
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Lys Gin Ser Lys 
500 

Cys Pro Leu Asn 
515 



Asp Arg Glu Gly 



Leu Pro Leu Gin 
520 



Pro Thr Asp His 
505 

Asn Asn His Thr 



Leu Glu Ser Ala 
510 

Ala Ala Asp Met 
525 



Tyr Leu Ser Pro Val 
530 

Val Asn Ser Thr Ala 
545 

Thr Gin Lys Pro Leu 
565 

Val Tyr Arg Leu Ala 
580 

Leu Ser Glu His Pro 
595 

His Thr Leu Gin Asn 
610 

Gin lie Met Met Cys 
625 

Asp Leu Lys Phe Lys 
645 

Ala Val Gin Glu Thr 
660 

Asp Ser lie lie Val 
675 

Thr Asn lie Leu Gin 
690 

lie Pro His lie Pro 
705 

Arg lie Pro Gly Gly 
725 

Lys lie Ser Glu Gly 
740 

Arg lie Leu Val Ser 
755 

Gin Lys lie Asn Gin 
770 



Arg Ser Pro Lys Lys Lys 
535 

Asn Ala Glu Thr Gin Ala 
550 555 

Lys Ser Thr Ser Leu Ser 
570 

Tyr Leu Arg Leu Asn Thr 
585 

Glu Leu Glu His He He 
600 

Glu Tyr Glu Leu Met Arg 
615 

Ser Met Tyr Gly He Cys 
630 635 

He He Val Thr Ala Tyr 
650 

Phe Lys Arg Val Leu He 
665 

Phe Tyr Asn Ser Val Phe 
680 

Tyr Ala Ser Thr Arg Pro 
695 

Arg Ser Pro Tyr Lys Phe 
710 715 

Asn He Tyr He Ser Pro 
730 

Leu Pro Thr Pro Thr Lys 
745 

He Gly Glu Ser Phe Gly 
760 

Met Val Cys Asn Ser Asp 
775 



Gly Ser Thr Thr Arg 
540 

Thr Ser Ala Phe Gin 
560 

Leu Phe Tyr Lys Lys 
575 

Leu Cys Glu Arg Leu 
590 

Trp Thr Leu Phe Gin 
605 

Asp Arg His Leu Asp 
620 

Lys Val Lys Asn He 
640 

Lys Asp Leu Pro His 
655 

Lys Glu Glu Glu Tyr 
670 

Met Gin Arg Leu Lys 
685 

Pro Thr Leu Ser Pro 
700 

Pro Ser Ser Pro Leu 
720 

Leu Lys Ser Pro Tyr 
735 

Met Thr Pro Arg Ser 
750 

Thr Ser Glu Lys Phe 
765 

Arg Val Leu Lys Arg 
780 
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Ser Ala Glu Gly 
785 

Asp lie Glu Gly 



Glu Ser Lys Phe 
820 

Arg Met Gin Lys 
835 

Glu Glu Lys 
850 



Ser Asn Pro Pro 
790 

Ser Asp Glu Ala 
805 

Gin Gin Lys Leu 



Gin Lys Met Asn 
840 



Lys Pro Leu Lys 
795 

Asp Gly Ser Lys 
810 

Ala Glu Met Thr 
825 

Asp Ser Met Asp 



Lys Leu Arg Phe 
800 

His Leu Pro Gly 
815 

Ser Thr Arg Thr 
830 

Thr Ser Asn Lys 
845 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 7.. 2 697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 4 8 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 

GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 

GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTG GTC GAC 240 
Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Val Asp 
65 70 75 

CTA GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA 288 
Leu Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu 
80 85 90 
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ATC AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT 336 
lie Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser 
95 100 105 110 

ACC AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA 3 84 

Thr Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val 
115 120 125 

TTG TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT 432 
Leu Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr 
130 135 140 

TTG ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG 480 
Leu Thr Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu 
145 150 155 

GTG CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA 52 8 

Val Leu Lys Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val 
160 165 170 

TTA CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT 576 
Leu Gin Met Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys 
175 180 185 190 

GTC CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA 624 
Val Leu Asp Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu 
195 200 205 

CCA TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC 672 
Pro Tyr Lys Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg Thr Pro 
210 215 220 

AGG CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT 72 0 

Arg Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu Glu Asn 
225 230 235 

GAT ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA 768 
Asp Thr Arg lie lie Glu Val Leu Cys Lys Glu His Glu Cys Asn lie 
240 245 250 

GAT GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT 816 
Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn 
255 260 265 270 

TCT CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT 864 
Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu 
275 280 285 

TCT AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA 912 
Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala 
290 295 300 
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AGA TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC 960 
Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp 
305 310 315 

AGT TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG 1008 
Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu 
320 325 330 

GTG AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT 1056 
Val Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr 
335 340 345 350 

ATC CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA 1104 
lie Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser 
355 360 365 

GAA AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA 1152 
Glu Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu 
370 375 380 

AGT ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA 120 0 

Ser lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys 
385 390 395 

TTT GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA 1248 
Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg 
400 405 410 

TAC AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT 1296 
Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu 
415 420 425 430 

AAA TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG 1344 
Lys Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu 
435 440 445 

AAT GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT 13 92 

Asn Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val 
450 455 460 

GTA ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA 144 0 

Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly 
465 470 475 

ACA GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC 1488 
Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala 
480 485 490 

TTT GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC 1536 
Phe Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn 
495 500 505 510 
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TTG ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC 1584 

Leu Thr Arg Glu Met lie Lys His Leu Glu Arg Cys Glu His Arg lie 

515 520 525 

ATG GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT 1632 

Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie 

530 535 540 

AAA CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT 1680 

Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala 
545 550 555 

TGT CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG 1728 

Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met 
560 565 570 

TAT CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT 1776 

Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg 

575 580 585 590 

GTA AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG 1824 

Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin 

595 600 605 

ACC CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA 1872 

Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys 

610 615 620 

GTG TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT 192 0 

Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu 
625 630 635 

CTG TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG 1968 

Leu Ser Glu His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin 
640 645 650 

CAC ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC 2 016 

His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp 

655 660 665 670 

CAA ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA 2 064 

Gin lie Met Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie 

675 680 685 

GAC CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT 2112 

Asp Leu Lys Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His 

690 695 700 

GCT GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT 2160 

Ala Val Gin Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr 
705 710 715 
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GAT TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA 22 08 

Asp Ser lie lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys 
720 725 730 

ACA AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA 22 56 

Thr Asn lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro 
735 740 745 750 

ATA CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA 2 3 04 

lie Pro His lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu 
755 760 765 

CGG ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT 2352 
Arg lie Pro Gly Gly Asn lie Tyr lie Ser Pro Leu Lys Ser Pro Tyr 
770 775 780 

AAA ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA 240 0 

Lys lie Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser 
785 790 795 

AGA ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC 2448 
Arg lie Leu Val Ser lie Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe 
800 805 810 

CAG AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA 24 96 

Gin Lys lie Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg 
815 820 825 830 

AGT GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT 2544 
Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe 
835 840 845 

GAT ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA 2 592 

Asp lie Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly 
850 855 860 

GAG TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA 2640 
Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr 
865 870 875 

CGA ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG 2 6 88 

Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys 
880 885 890 

GAA GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT 2 73 7 

Glu Glu Lys 

895 

GGATTCATTG TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC 2 797 

ATTTAATATC TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA 2857 

TTCCTAAGCC ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA 2917 
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AATCCTGCCA TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG 2 977 

CTTTATGGAT AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC 3037 

TACTTTGCCT TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT 3097 

GTATATTTTT TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA 315 7 

TGCAATTTGA TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT 3217 

ATTATTAGAA ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG 3277 

AATCTGATAT ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA 333 7 

AGCAAAGTAT AACCATATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG 3397 

TAAAAGAACT TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA 3457 

TAGT 34 61 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Val Asp Leu Asp 
65 70 75 80 

Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu lie Ser 
85 90 95 

Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser Thr Lys 
100 105 110 
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Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe 
115 120 125 



Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr 
130 135 140 

Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu 
145 150 155 160 

Lys Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin 
165 170 175 

Met Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val Leu 
180 185 190 

Asp Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr 
195 200 205 

Lys Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg Arg 
210 215 220 

Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu Glu Asn Asp Thr 
225 230 235 240 

Arg lie lie Glu Val Leu Cys Lys Glu His Glu Cys Asn lie Asp Glu 
245 250 255 

Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn Ser Leu 
260 265 270 

Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys 
275 280 285 

Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu 
290 295 300 

Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe 
305 310 315 320 

Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn 
325 330 335 

Val lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin 
340 345 350 

Gin Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn 
355 360 365 

Leu He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He 
370 375 380 

Leu Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala 
385 390 395 400 
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Lys Ala Val Gly Gin Gly Cys Val Glu lie Gly Ser Gin Arg Tyr Lys 
405 410 415 



Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser 
420 425 430 

Glu Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys Leu Leu Asn Asp 
435 440 445 

Asn lie Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met 
450 455 460 

Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp 
465 470 475 480 

Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp 
485 490 495 

Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr 
500 505 510 

Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met Glu 
515 520 525 

Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin 
530 535 540 

Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro 
545 550 555 560 

Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu 
565 570 575 

Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn 
580 585 590 

Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin 
595 600 605 

Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr 
610 615 620 

Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser 
625 630 635 640 

Glu His Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His Thr 
645 650 655 

Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He 
660 665 670 

Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu 
675 680 685 
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Lys Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val 
690 695 700 



Gin Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp Ser 
705 710 715 720 

lie lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn 
725 730 735 

lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie Pro 
740 745 750 

His lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He 
755 760 765 

Pro Gly Gly Asn lie Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He 
770 775 780 

Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He 
785 790 795 800 

Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys 
805 810 815 

He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala 
820 825 830 

Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He 
835 840 845 

Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser 
850 855 860 

Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met 
865 870 875 880 

Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu 
885 890 895 

Lys 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 34 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 7.. 2583 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 



CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 

GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 

GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 24 0 

Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 

GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 28 8 

Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys 
80 85 90 

AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 336 
Lys Lys Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp Leu 
95 100 105 110 

GTC GAA TCT ACT GAA ATA AAT TCT GCA TTG GTG CTA AAA GTT TCT TGG 384 
Val Glu Ser Thr Glu lie Asn Ser Ala Leu Val Leu Lys Val Ser Trp 
115 120 125 

ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA CAA ATG GAA GAT GAT 432 
lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu Asp Asp 
130 135 140 

CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC CTT GAC TAT TTT ATT 48 0 

Leu Val lie Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr Phe lie 
145 150 155 

AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA TAT AAA ACA GCT GTT 52 8 

Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr Ala Val 
160 165 170 

ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG CGA GGT CAG AAC AGG 576 
lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg 
175 180 185 190 



-207- 



AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT ACA AGA ATT ATT GAA 624 
Ser Ala Arg lie Ala Lys Gin Leu Glu Asn Asp Thr Arg He He Glu 
195 200 205 

GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT GAG GTG AAA AAT GTT 6 72 

Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val Lys Asn Val 
210 215 220 

TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT CTT GGA CTT GTA ACA 72 0 

Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu Val Thr 
225 230 235 

TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT AAA CGA TAC GAA GAA 768 
Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu 
240 245 250 

ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA TTA TTT TTG GAT CAT 816 
He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His 
255 260 265 270 

GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT TTT GAA ACA CAG AGA 864 
Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu Thr Gin Arg 
275 280 285 

ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG AAT GTA ATT CCT CCA 912 
Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val He Pro Pro 
290 295 300 

CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC CAA CAA TTA ATG ATG 96 0 

His Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin Leu Met Met 
305 310 315 

ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA AAT CTG ATT TCC TAT 1008 
He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu He Ser Tyr 
320 325 330 

TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT ATA CTG AAA AGA GTG 10 56 

Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu Lys Arg Val 
335 340 345 350 

AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT GCT AAA GCT GTG GGA 1104 

Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala Val Gly 
355 360 365 

CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC AAA CTT GGA GTT CGC 1152 
Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg 
370 375 380 

TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA TCA GAA GAA GAA CGA 12 00 

Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg 
385 390 395 
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TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT GAC AAC ATT TTT CAT 1248 
Leu Ser lie Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn lie Phe His 
400 405 410 

ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA ATG GCC ACA TAT AGC 12 96 

Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser 
415 420 425 430 

AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA GAT TTG TCT TTC CCA 1344 
Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro 
435 440 445 

TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT GAT TTT TAC AAA GTG 13 92 

Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val 
450 455 460 

ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG ACA AGA GAA ATG ATA 144 0 

He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg Glu Met He 
465 470 475 

AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG GAA TCC CTT GCA TGG 1488 
Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser Leu Ala Trp 
480 485 490 

CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA CAA TCA AAG GAC CGA 1536 
Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser Lys Asp Arg 
495 500 505 510 

GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT CCT CTT AAT CTT CCT 1584 
Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro 
515 520 525 

CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT CTT TCT CCT GTA AGA 1632 
Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg 
530 535 540 

TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA AAT TCT ACT GCA AAT 1680 
Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn 
545 550 555 

GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC CAG AAG CCA TTG AAA 172 8 

Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys 
560 565 570 

TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG TAT CGG CTA GCC TAT 1776 
Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr 
575 580 585 590 

CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG TCT GAG CAC CCA GAA 1824 
Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His Pro Glu 
595 600 605 
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TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC ACC CTG CAG AAT GAG 1872 
Leu Glu His lie lie Trp Thr Leu Phe Gin His Thr Leu Gin Asn Glu 
610 615 620 

TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA ATT ATG ATG TGT TCC 192 0 

Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin lie Met Met Cys Ser 
625 630 635 

ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC CTT AAA TTC AAA ATC 1968 
Met Tyr Gly lie Cys Lys Val Lys Asn lie Asp Leu Lys Phe Lys lie 
640 645 650 

ATT GTA AC A GCA TAC AAG GAT CTT CCT CAT GCT GTT CAG GAG ACA TTC 2 016 

lie Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe 
655 660 665 670 

AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT TCT ATT ATA GTA TTC 2064 
Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp Ser lie lie Val Phe 
675 680 685 

TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA AAT ATT TTG CAG TAT 2112 
Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn lie Leu Gin Tyr 
690 695 700 

GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA CCT CAC ATT CCT CGA 216 0 

Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie Pro His He Pro Arg 
705 710 715 

AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG ATT CCT GGA GGG AAC 22 08 

Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro Gly Gly Asn 
720 725 730 

ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA ATT TCA GAA GGT CTG 2256 
He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser Glu Gly Leu 
735 740 745 750 

CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA ATC TTA GTA TCA ATT 2304 
Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu Val Ser He 
755 760 765 

GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG AAA ATA AAT CAG ATG 23 52 

Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He Asn Gin Met 
770 775 780 

GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT GCT GAA GGA AGC AAC 24 00 

Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly Ser Asn 
785 790 795 

CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT ATT GAA GGA TCA GAT 2448 
Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu Gly Ser Asp 
800 805 810 
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GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG TCC AAA TTT CAG CAG 2496 
Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe Gin Gin 
815 820 825 830 

AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA ATG CAA AAG CAG AAA 2 544 

Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys Gin Lys 
835 840 845 

ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA GAG AAA TGAGGATCTC 25 93 

Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
850 855 

AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG TCTCTCACAG ATGTGACTGT 2653 

ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC TTCAGCTCTT TTTGTGGATA 2713 

TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC ACTTGAAATG TTAGTCATTG 2 773 

TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA TTTAAAAAGT TGTAGCAGAT 2833 

TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT AGTAAGAATG GCCCTAGAGT 2893 

GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT TCTTTTGTAG CATATAGGTG 2953 

ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT TTAATTTAAC ATGAACACCC 3013 

TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA TTGACTGCCC ATTCACCAAA 3 0 73 

ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA ATTAGAAAAA AATTACTAAT 3133 

TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT ACTGTGTGCT TGTTTTATAA 3193 

AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT AAC CATATGA TACTATCATA 32 53 

CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT TACTGATTAT TTTCTTCATC 3313 

CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3347 

(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 859 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 
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Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 



Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 

Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp Leu Val Glu 
100 105 110 

Ser Thr Glu lie Asn Ser Ala Leu Val Leu Lys Val Ser Trp lie Thr 
115 120 125 

Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu Asp Asp Leu Val 
130 135 140 

lie Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr Phe lie Lys Leu 
145 150 155 160 

Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr Ala Val lie Pro 
165 170 175 

lie Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg Ser Ala 
180 185 190 

Arg lie Ala Lys Gin Leu Glu Asn Asp Thr Arg lie He Glu Val Leu 
195 200 205 

Cys Lys Glu His Glu Cys Asn He Asp Glu Val Lys Asn Val Tyr Phe 
210 215 220 

Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu Val Thr Ser Asn 
225 230 235 240 

Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu He Tyr 
245 250 255 

Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His Asp Lys 
260 265 270 

Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu Thr Gin Arg Thr Pro 
275 280 285 

Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val He Pro Pro His Thr 
290 295 300 
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Pro Val Arg Thr Val Met Asn Thr lie Gin Gin Leu Met Met lie Leu 
305 310 315 320 



Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu He Ser Tyr Phe Asn 
325 330 335 

Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu Lys Arg Val Lys Asp 
340 345 350 

He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala Val Gly Gin Gly 
355 360 365 

Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr 
370 375 380 

Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg Leu Ser 
385 390 395 400 

He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn He Phe His Met Ser 
405 410 415 

Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser Arg Ser 
420 425 430 

Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro Trp He 
435 440 445 

Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val He Glu 
450 455 460 

Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg Glu Met He Lys His 
465 470 475 480 

Leu Glu Arg Cys Glu His Arg He Met Glu Ser Leu Ala Trp Leu Ser 
485 490 495 

Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser Lys Asp Arg Glu Gly 
500 505 510 

Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin 
515 520 525 

Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg Ser Pro 
530 535 540 

Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn Ala Glu 
545 550 555 560 

Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr 
565 570 575 

Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg 
580 585 590 
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Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His Pro Glu Leu Glu 
595 600 605 



His lie lie Trp Thr Leu Phe Gin His Thr Leu Gin Asn Glu Tyr Glu 
610 615 620 

Leu Met Arg Asp Arg His Leu Asp Gin lie Met Met Cys Ser Met Tyr 
625 630 635 640 

Gly lie Cys Lys Val Lys Asn lie Asp Leu Lys Phe Lys lie lie Val 
645 650 655 

Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe Lys Arg 
660 665 670 

Val Leu lie Lys Glu Glu Glu Tyr Asp Ser lie lie Val Phe Tyr Asn 
675 680 685 

Ser Val Phe Met Gin Arg Leu Lys Thr Asn lie Leu Gin Tyr Ala Ser 
690 695 700 

Thr Arg Pro Pro Thr Leu Ser Pro lie Pro His lie Pro Arg Ser Pro 
705 710 715 720 

Tyr Lys Phe Pro Ser Ser Pro Leu Arg lie Pro Gly Gly Asn lie Tyr 
725 730 735 

lie Ser Pro Leu Lys Ser Pro Tyr Lys lie Ser Glu Gly Leu Pro Thr 
740 745 750 

Pro Thr Lys Met Thr Pro Arg Ser Arg lie Leu Val Ser lie Gly Glu 
755 760 765 

Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys lie Asn Gin Met Val Cys 
770 775 780 

Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro 
785 790 795 800 

Lys Pro Leu Lys Lys Leu Arg Phe Asp lie Glu Gly Ser Asp Glu Ala 

805 810 815 

Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu 
820 825 830 

Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys Gin Lys Met Asn 
835 840 845 

Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
850 855 



(2) INFORMATION FOR SEQ ID NO:44: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 7. .23 97 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 

GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 

GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 24 0 

Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 

GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 2 88 

Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys 
80 85 90 

AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 336 
Lys Lys Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu 
95 100 105 110 

GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG CGA GGT CAG 3 84 

Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin 
H5 12 o 125 

AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT ACA AGA ATT 432 
Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg He 
130 135 140 

ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT GAG GTG AAA 480 
He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val Lys 
145 150 155 
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AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT CTT GGA CTT 52 8 

Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu 
160 165 170 

GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT AAA CGA TAC 576 
Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr 
175 180 185 190 

GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA TTA TTT TTG 624 
Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu 
195 200 205 

GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT TTT GAA ACA 6 72 

Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu Thr 
210 215 220 

CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG AAT GTA ATT 720 
Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val He 
225 230 235 

CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC CAA CAA TTA 768 
Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin Leu 
240 245 250 

ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA AAT CTG ATT 816 
Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu He 
255 260 265 270 

TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT ATA CTG AAA 864 
Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu Lys 
275 280 285 

AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT GCT AAA GCT 912 
Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala 
290 295 300 

GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC AAA CTT GGA 960 
Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly 
305 310 315 

GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA TCA GAA GAA 1008 
Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu 
320 325 330 

GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT GAC AAC ATT 1056 
Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn He 
335 340 345 350 

TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA ATG GCC ACA 1104 
Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr 
355 360 365 
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TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA GAT TTG TCT 1152 
Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser 
370 375 380 

TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT GAT TTT TAC 12 0 0 

Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr 
385 390 395 

AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG ACA AGA GAA 1248 
Lys Val lie Glu Ser Phe lie Lys Ala Glu Gly Asn Leu Thr Arg Glu 
400 405 410 

ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG GAA TCC CTT 1296 
Met lie Lys His Leu Glu Arg Cys Glu His Arg lie Met Glu Ser Leu 
415 420 425 430 

GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA CAA TCA AAG 1344 
Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys Gin Ser Lys 
435 440 445 

GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT CCT CTT AAT 13 92 

Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn 
450 455 460 

CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT CTT TCT CCT 144 0 

Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro 
465 470 475 

GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA AAT TCT ACT 14 8 8 

Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr 
480 485 490 

GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC CAG AAG CCA 153 6 

Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro 
495 500 505 510 

TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG TAT CGG CTA 1584 
Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu 
515 520 525 

GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG TCT GAG CAC 1632 
Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His 
530 535 540 

CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC ACC CTG CAG 16 8 0 

Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His Thr Leu Gin 
545 550 555 

AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA ATT ATG ATG 1728 
Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin lie Met Met 
560 565 570 
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TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC CTT AAA TTC 1776 
Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie Asp Leu Lys Phe 
575 580 585 590 

AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT GTT CAG GAG 1824 
Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu 
595 600 605 

ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT TCT ATT ATA 1872 
Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp Ser lie lie 
610 615 620 

GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA AAT ATT TTG 192 0 

Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn lie Leu 
625 630 635 

CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA CCT CAC ATT 1968 
Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie Pro His lie 
640 645 650 

CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG ATT CCT GGA 2 016 

Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg lie Pro Gly 
655 660 665 670 

GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA ATT TCA GAA 2064 

Gly Asn lie Tyr lie Ser Pro Leu Lys Ser Pro Tyr Lys lie Ser Glu 
675 680 685 

GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA ATC TTA GTA 2112 
Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg lie Leu Val 
690 695 700 

TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG AAA ATA AAT 2160 
Ser lie Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys lie Asn 
705 710 715 

CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT GCT GAA GGA 22 08 

Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly 
720 725 730 

AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT ATT GAA GGA 2256 
Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp lie Glu Gly 
735 740 745 750 

TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG TCC AAA TTT 23 04 

Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe 
755 760 765 

CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA ATG CAA AAG 2352 
Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys 
770 775 780 
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CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA GAG AAA 23 97 

Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
785 790 795 

TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG TCTCTCACAG 2457 

ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC TTCAGCTCTT 2 517 

TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC AGTTGAAATG 2 577 

TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA TTTAAAAAGT 263 7 

TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT AGTAAGAATG 269 7 

GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT TCTTTTGTAG 2 757 

CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT TTAATTTAAC 2817 

ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA TTGACTGCCC 2877 

ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA ATTAGAAAAA 2 93 7 

AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AAT CTGATAT ACTGTGTGCT 2 997 

TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT AACCATATGA 3 057 

TACTATCATA CTAC TGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT TACTGATTAT 3117 

TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3161 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
1 5 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 
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Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 



Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp Leu Ala Val 
100 105 110 

lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg 
115 120 125 

Ser Ala Arg lie Ala Lys Gin Leu Glu Asn Asp Thr Arg lie lie Glu 
130 135 140 

Val Leu Cys Lys Glu His Glu Cys Asn lie Asp Glu Val Lys Asn Val 
145 150 155 160 

Tyr Phe Lys Asn Phe lie Pro Phe Met Asn Ser Leu Gly Leu Val Thr 
165 170 175 

Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu 
180 185 190 

lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His 
195 200 205 

Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe Glu Thr Gin Arg 
210 215 220 

Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val lie Pro Pro 
225 230 235 240 

His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin Gin Leu Met Met 
245 250 255 

lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu lie Ser Tyr 
260 265 270 

Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie Leu Lys Arg Val 
275 280 285 

Lys Asp lie Gly Tyr lie Phe Lys Glu Lys Phe Ala Lys Ala Val Gly 
290 295 300 

Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg 
305 310 315 320 

Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg 
325 330 335 

Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn He Phe His 
340 345 350 
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Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser 
355 360 365 



Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro 
370 375 380 

Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val 
385 390 395 400 

He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg Glu Met He 
405 410 415 

Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser Leu Ala Trp 
420 425 430 

Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser Lys Asp Arg 
435 440 445 

Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro 
450 455 460 

Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg 
465 470 475 480 

Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn 
485 490 495 

Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys 
500 505 510 

Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr 
515 520 525 

Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His Pro Glu 
530 535 540 

Leu Glu His He He Trp Thr Leu Phe Gin His Thr Leu Gin Asn Glu 
545 550 555 560 

Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He Met Met Cys Ser 
565 570 575 

Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys Phe Lys He 
580 585 590 

He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe 
595 600 605 

Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He He Val Phe 
610 615 620 

Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He Leu Gin Tyr 
625 630 635 640 



-221- 



Ala Ser Thr Arg 



Ser Pro Tyr Lys 
660 

lie Tyr lie Ser 
675 

Pro Thr Pro Thr 
690 

Gly Glu Ser Phe 
705 

Val Cys Asn Ser 



Pro Pro Lys Pro 
740 

Glu Ala Asp Gly 
755 

Lys Leu Ala Glu 
770 

Met Asn Asp Ser 
785 



Pro Pro Thr Leu 
645 

Phe Pro Ser Ser 



Pro Leu Lys Ser 
680 

Lys Met Thr Pro 
695 

Gly Thr Ser Glu 
710 

Asp Arg Val Leu 
725 

Leu Lys Lys Leu 



Ser Lys His Leu 
760 

Met Thr Ser Thr 
775 

Met Asp Thr Ser 
790 



Ser Pro lie Pro 
650 

Pro Leu Arg lie 
665 

Pro Tyr Lys lie 



Arg Ser Arg lie 
700 

Lys Phe Gin Lys 
715 

Lys Arg Ser Ala 
730 

Arg Phe Asp lie 
745 

Pro Gly Glu Ser 



Arg Thr Arg Met 
780 

Asn Lys Glu Glu 
795 



His lie Pro Arg 
655 

Pro Gly Gly Asn 
670 

Ser Glu Gly Leu 
685 

Leu Val Ser lie 



lie Asn Gin Met 
720 

Glu Gly Ser Asn 
735 

Glu Gly Ser Asp 
750 

Lys Phe Gin Gin 
765 

Gin Lys Gin Lys 



Lys 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 7.. 2613 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
1 5 10 

GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 
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GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 240 
Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 

GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 2 88 

Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys 
80 85 90 

AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 336 
Lys Lys Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp Leu 
95 100 105 110 

GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 384 
Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu lie 
115 120 125 

AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 432 
Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser Thr 
130 135 140 

AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 480 
Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
145 150 155 

TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 52 8 

Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu 
160 165 170 

ACA CAA CCC AGC AGT TCG ATG GTC GCT GTT ATA CCC ATT AAT GGT TCA 576 
Thr Gin Pro Ser Ser Ser Met Val Ala Val lie Pro lie Asn Gly Ser 
175 180 185 190 

CCT CGA ACA CCC AGG CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA 624 
Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys 
195 200 205 

CAA CTA GAA AAT GAT ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT 672 
Gin Leu Glu Asn Asp Thr Arg lie lie Glu Val Leu Cys Lys Glu His 
210 215 220 

GAA TGT AAT ATA GAT GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA 720 
Glu Cys Asn lie Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie 
225 230 235 
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CCT TTT ATG AAT TCT CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG 768 
Pro Phe Met Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu 
240 245 250 

GTT GAA AAT CTT TCT AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA 816 
Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys 
255 260 265 270 

GAT CTA GAT GCA AGA TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT 864 
Asp Leu Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr 
275 280 285 

GAT TCT ATA GAC AGT TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC 912 
Asp Ser lie Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn 
290 295 300 

CTT GAT GAA GAG GTG AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT 960 
Leu Asp Glu Glu Val Asn Val lie Pro Pro His Thr Pro Val Arg Thr 
305 310 315 

GTT ATG AAC ACT ATC CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT 1008 
Val Met Asn Thr lie Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser 
320 325 330 

GAT CAA CCT TCA GAA AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG 10 56 

Asp Gin Pro Ser Glu Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val 
335 340 345 350 

AAT CCA AAA GAA AGT ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC 1104 
Asn Pro Lys Glu Ser lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie 
355 360 365 

TTT AAA GAG AAA TTT GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT 1152 
Phe Lys Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu lie 
370 375 380 

GGA TCA CAG CGA TAC AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG 12 00 

Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met 
385 390 395 

GAA TCC ATG CTT AAA TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT 1248 
Glu Ser Met Leu Lys Ser Glu Glu Glu Arg Leu Ser lie Gin Asn Phe 
400 405 410 

AGC AAA CTT CTG AAT GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC 12 96 

Ser Lys Leu Leu Asn Asp Asn lie Phe His Met Ser Leu Leu Ala Cys 
415 420 425 430 

GCT CTT GAG GTT GTA ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT 1344 
Ala Leu Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn 
435 440 445 
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CTT GAT TCT GGA ACA GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT 1392 
Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu 
450 455 460 

AAT TTA AAA GCC TTT GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA 144 0 

Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val He Glu Ser Phe He Lys 
465 470 475 

GCA GAA GGC AAC TTG ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT 1488 
Ala Glu Gly Asn Leu Thr Arg Glu Met He Lys His Leu Glu Arg Cys 
480 485 490 

GAA CAT CGA ATC ATG GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA 1536 
Glu His Arg lie Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu 
495 500 505 510 

TTT GAT CTT ATT AAA CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC 1584 
Phe Asp Leu He Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His 
515 520 525 

CTT GAA TCT GCT TGT CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT 1632 
Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr 
530 535 540 

GCA GCA GAT ATG TAT CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT 1680 
Ala Ala Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly 
545 550 555 

TCA ACT ACG CGT GTA AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC 1728 
Ser Thr Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr 
560 565 570 

TCA GCC TTC CAG ACC CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG 1776 
Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu 
575 580 585 590 

TTT TAT AAA AAA GTG TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT 1824 
Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu 
595 600 605 

TGT GAA CGC CTT CTG TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG 1872 
Cys Glu Arg Leu Leu Ser Glu His Pro Glu Leu Glu His He He Trp 
610 615 620 

ACC CTT TTC CAG CAC ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC 1920 
Thr Leu Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp 
625 630 635 

AGG CAT TTG GAC CAA ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA 1968 
Arg His Leu Asp Gin He Met Met Cys Ser Met Tyr Gly He Cys Lys 
^40 645 650 
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GTG AAG AAT ATA GAC CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG 2 016 

Val Lys Asn lie Asp Leu Lys Phe Lys lie lie Val Thr Ala Tyr Lys 
655 660 665 670 

GAT CTT CCT CAT GCT GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA 2 064 

Asp Leu Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu lie Lys 
675 680 685 

GAA GAG GAG TAT GAT TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG 2112 
Glu Glu Glu Tyr Asp Ser lie lie Val Phe Tyr Asn Ser Val Phe Met 
690 695 700 

CAG AGA CTG AAA ACA AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT 216 0 

Gin Arg Leu Lys Thr Asn lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro 
705 710 715 

ACC TTG TCA CCA ATA CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT 22 08 

Thr Leu Ser Pro lie Pro His lie Pro Arg Ser Pro Tyr Lys Phe Pro 
720 725 730 

AGT TCA CCC TTA CGG ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG 2256 
Ser Ser Pro Leu Arg lie Pro Gly Gly Asn lie Tyr lie Ser Pro Leu 
735 740 745 750 

AAG AGT CCA TAT AAA ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG 23 04 

Lys Ser Pro Tyr Lys lie Ser Glu Gly Leu Pro Thr Pro Thr Lys Met 
755 760 765 

ACT CCA AGA TCA AGA ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT 2352 
Thr Pro Arg Ser Arg lie Leu Val Ser lie Gly Glu Ser Phe Gly Thr 
770 775 780 

TCT GAG AAG TTC CAG AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT 24 00 

Ser Glu Lys Phe Gin Lys lie Asn Gin Met Val Cys Asn Ser Asp Arg 
785 790 795 

GTG CTC AAA AGA AGT GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA 2448 
Val Leu Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys 
800 805 810 

AAA CTA CGC TTT GAT ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA 24 96 

Lys Leu Arg Phe Asp lie Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys 
815 820 825 830 

CAT CTC CCA GGA GAG TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT 2544 
His Leu Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr 
835 840 845 

TCT ACT CGA ACA CGA ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT 2592 
Ser Thr Arg Thr Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp 
850 855 860 
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ACC TCA AAC AAG GAA GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT 2 643 
Thr Ser Asn Lys Glu Glu Lys 
865 

GTACACCTCT GGATTCATTG TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT 2703 

TTATGGCCAC ATTTAATATC TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT 2 763 

GTTTGGGTGA TTCCTAAGCC ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA 2 823 

ATCTTGTGTA AATCCTGCCA TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA 2 883 

AATTGCTGTG CTTTATGGAT AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG 2943 

CCTGTCTGAC TACTTTGCCT TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT 3 003 

TAATTTATAT GTATATTTTT TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT 3 063 

ATCTTCCAAA TGCAATTTGA TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC 312 3 

AAAAATGGAT ATTATTAGAA ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT 3183 

TTACTATTGG AATCTGATAT ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT 3243 

AAAAGCTGGA AGCAAAGTAT AAC CATATGA TACTATCATA CTACTGAAAC AGATTTCATA 33 03 

CCTCAGAATG TAAAAGAACT TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA 3 363 
GGATTATTGA TAGT 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
1 5 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 
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Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 



Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp Leu Asp Glu 
100 105 110 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu lie Ser Val 
115 120 125 

His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser Thr Lys Val 
130 135 140 

Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
145 150 155 160 

Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr Gin 
165 170 175 

Pro Ser Ser Ser Met Val Ala Val lie Pro lie Asn Gly Ser Pro Arg 
180 185 190 

Thr Pro Arg Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu 
195 200 205 

Glu Asn Asp Thr Arg lie lie Glu Val Leu Cys Lys Glu His Glu Cys 
210 215 220 

Asn lie Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe 
225 230 235 240 

Met Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu 
245 250 255 

Asn Leu Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu 
260 265 270 

Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser 
275 280 285 

lie Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp 
290 295 300 

Glu Glu Val Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met 
305 310 315 320 

Asn Thr lie Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin 
325 330 335 

Pro Ser Glu Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro 
340 345 350 
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Lys Glu Ser lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys 
355 360 365 



Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu lie Gly Ser 
370 375 380 

Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser 
385 390 395 400 

Met Leu Lys Ser Glu Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys 
405 410 415 

Leu Leu Asn Asp Asn lie Phe His Met Ser Leu Leu Ala Cys Ala Leu 
420 425 430 

Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp 
435 440 445 

Ser Gly Thr Asp Leu Ser Phe Pro Trp lie Leu Asn Val Leu Asn Leu 
450 455 460 

Lys Ala Phe Asp Phe Tyr Lys Val lie Glu Ser Phe lie Lys Ala Glu 
465 470 475 480 

Gly Asn Leu Thr Arg Glu Met lie Lys His Leu Glu Arg Cys Glu His 
485 490 495 

Arg lie Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp 
500 505 510 

Leu lie Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Lea Glu 
515 520 525 

Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala 
530 535 540 

Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr 
545 550 555 560 

Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala 
565 570 575 

Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr 
580 585 590 

Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu 
595 600 605 

Arg Leu Leu Ser Glu His Pro Glu Leu Glu His lie lie Trp Thr Leu 
610 615 620 

Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His 
625 630 635 640 
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Leu Asp Gin lie Met Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys 
645 650 655 



Asn lie Asp Leu Lys Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu 
660 665 670 

Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu 
675 680 685 

Glu Tyr Asp Ser lie lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg 
690 695 700 

Leu Lys Thr Asn lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu 
705 710 715 720 

Ser Pro lie Pro His lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser 
725 730 735 

Pro Leu Arg lie Pro Gly Gly Asn lie Tyr lie Ser Pro Leu Lys Ser 
740 745 750 

Pro Tyr Lys lie Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro 
755 760 765 

Arg Ser Arg lie Leu Val Ser lie Gly Glu Ser Phe Gly Thr Ser Glu 
770 775 780 

Lys Phe Gin Lys lie Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu 
785 790 795 800 

Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu 
805 810 815 

Arg Phe Asp He Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu 
820 825 830 

Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr 
835 840 845 

Arg Thr Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser 
850 855 860 

Asn Lys Glu Glu Lys 
865 



(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3383 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 7.. 2619 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 

GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 

Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 

15 20 25 30 

GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 

Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 

35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 

Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 

50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 240 

Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 

GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 288 

Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys 
80 85 90 

AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 336 

Lys Lys Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu 

95 100 105 110 

GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 3 84 

Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He 

115 120 125 

AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 432 

Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr 

130 135 140 

AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 48 0 

Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
145 150 155 

TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 528 

Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu 
160 165 170 

ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG 576 

Thr Gin Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val 

175 180 185 190 
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CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA 624 
Leu Lys Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu 
195 200 205 

CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC 672 
Gin Met Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val 
210 215 220 

CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA 720 
Leu Asp Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro 
225 230 235 

TAT AAA ACA GGG TCG AAT TCT CTT GGA CTT GTA ACA TCT AAT GGA CTT 768 
Tyr Lys Thr Gly Ser Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu 
240 245 250 

CCA GAG GTT GAA AAT CTT TCT AAA CGA TAC GAA GAA ATT TAT CTT AAA 816 
Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys 
255 260 , 265 270 

AAT AAA GAT CTA GAT GCA AGA TTA TTT TTG GAT CAT GAT AAA ACT CTT 864 
Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu 
275 280 285 

CAG ACT GAT TCT ATA GAC AGT TTT GAA ACA CAG AGA ACA CCA CGA AAA 912 
Gin Thr Asp Ser He Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys 
290 295 300 

AGT AAC CTT GAT GAA GAG GTG AAT GTA ATT CCT CCA CAC ACT CCA GTT 960 
Ser Asn Leu Asp Glu Glu Val Asn Val He Pro Pro His Thr Pro Val 
305 310 315 

AGG ACT GTT ATG AAC ACT ATC CAA CAA TTA ATG ATG ATT TTA AAT TCA 1008 
Arg Thr Val Met Asn Thr He Gin Gin Leu Met Met He Leu Asn Ser 
320 325 330 

GCA AGT GAT CAA CCT TCA GAA AAT CTG ATT TCC TAT TTT AAC AAC TGC 1056 
Ala Ser Asp Gin Pro Ser Glu Asn Leu He Ser Tyr Phe Asn Asn Cys 
335 340 345 350 

ACA GTG AAT CCA AAA GAA AGT ATA CTG AAA AGA GTG AAG GAT ATA GGA 1104 
Thr Val Asn Pro Lys Glu Ser He Leu Lys Arg Val Lys Asp He Gly 
355 360 365 

TAC ATC TTT AAA GAG AAA TTT GCT AAA GCT GTG GGA CAG GGT TGT GTC 1152 
Tyr He Phe Lys Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val 
370 375 380 

GAA ATT GGA TCA CAG CGA TAC AAA CTT GGA GTT CGC TTG TAT TAC CGA 1200 
Glu He Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg 
385 390 395 
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GTA ATG GAA TCC ATG CTT AAA TCA GAA GAA GAA CGA TTA TCC ATT CAA 1248 
Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg Leu Ser lie Gin 
400 405 410 

AAT TTT AGC AAA CTT CTG AAT GAC AAC ATT TTT CAT ATG TCT TTA TTG 12 96 

Asn Phe Ser Lys Leu Leu Asn Asp Asn lie Phe His Met Ser Leu Leu 
415 420 425 430 

GCG TGC GCT CTT GAG GTT GTA ATG GCC ACA TAT AGC AGA AGT ACA TCT 1344 
Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser 
435 440 445 

CAG AAT CTT GAT TCT GGA ACA GAT TTG TCT TTC CCA TGG ATT CTG AAT 13 92 

Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro Trp lie Leu Asn 
450 455 460 

GTG CTT AAT TTA AAA GCC TTT GAT TTT TAC AAA GTG ATC GAA AGT TTT 144 0 

Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val lie Glu Ser Phe 
465 470 475 

ATC AAA GCA GAA GGC AAC TTG ACA AGA GAA ATG ATA AAA CAT TTA GAA 1488 
lie Lys Ala Glu Gly Asn Leu Thr Arg Glu Met lie Lys His Leu Glu 
480 485 490 

CGA TGT GAA CAT CGA ATC ATG GAA TCC CTT GCA TGG CTC TCA GAT TCA 1536 
Arg Cys Glu His Arg lie Met Glu Ser Leu Ala Trp Leu Ser Asp Ser 
495 500 505 510 

CCT TTA TTT GAT CTT ATT AAA CAA TCA AAG GAC CGA GAA GGA CCA ACT 1584 
Pro Leu Phe Asp Leu lie Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr 
515 520 525 

GAT CAC CTT GAA TCT GCT TGT CCT CTT AAT CTT CCT CTC CAG AAT AAT 1632 
Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn 
530 535 540 

CAC ACT GCA GCA GAT ATG TAT CTT TCT CCT GTA AGA TCT CCA AAG AAA 1680 
His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys 
545 550 555 

AAA GGT TCA ACT ACG CGT GTA AAT TCT ACT GCA AAT GCA GAG ACA CAA 172 8 

Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin 
560 565 570 

GCA ACC TCA GCC TTC CAG ACC CAG AAG CCA TTG AAA TCT ACC TCT CTT 1776 
Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu 
575 580 585 590 

TCA CTG TTT TAT AAA AAA GTG TAT CGG CTA GCC TAT CTC CGG CTA AAT 1824 
Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn 
595 600 605 
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ACA CTT TGT GAA CGC CTT CTG TCT GAG CAC CCA GAA TTA GAA CAT ATC 18 72 

Thr Leu Cys Glu Arg Leu Leu Ser Glu His Pro Glu Leu Glu His lie 
610 615 620 

ATC TGG ACC CTT TTC CAG CAC ACC CTG CAG AAT GAG TAT GAA CTC ATG 192 0 

lie Trp Thr Leu Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met 
625 630 635 

AGA GAC AGG CAT TTG GAC CAA ATT ATG ATG TGT TCC ATG TAT GGC ATA 1968 
Arg Asp Arg His Leu Asp Gin lie Met Met Cys Ser Met Tyr Gly lie 
640 645 650 

TGC AAA GTG AAG AAT ATA GAC CTT AAA TTC AAA ATC ATT GTA ACA GCA 2 016 

Cys Lys Val Lys Asn lie Asp Leu Lys Phe Lys lie lie Val Thr Ala 
655 660 665 670 

TAC AAG GAT CTT CCT CAT GCT GTT CAG GAG ACA TTC AAA CGT GTT TTG 2 064 

Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu 
675 680 685 

ATC AAA GAA GAG GAG TAT GAT TCT ATT ATA GTA TTC TAT AAC TCG GTC 2112 
lie Lys Glu Glu Glu Tyr Asp Ser He He Val Phe Tyr Asn Ser Val 
690 695 700 

TTC ATG CAG AGA CTG AAA ACA AAT ATT TTG CAG TAT GCT TCC ACC AGG 2160 
Phe Met Gin Arg Leu Lys Thr Asn He Leu Gin Tyr Ala Ser Thr Arg 
705 710 715 

CCC CCT ACC TTG TCA CCA ATA CCT CAC ATT CCT CGA AGC CCT TAC AAG 2208 
Pro Pro Thr Leu Ser Pro He Pro His He Pro Arg Ser Pro Tyr Lys 
720 725 730 

TTT CCT AGT TCA CCC TTA CGG ATT CCT GGA GGG AAC ATC TAT ATT TCA 22 56 

Phe Pro Ser Ser Pro Leu Arg He Pro Gly Gly Asn He Tyr He Ser 
735 740 745 750 

CCC CTG AAG AGT CCA TAT AAA ATT TCA GAA GGT CTG CCA ACA CCA ACA 2304 
Pro Leu Lys Ser Pro Tyr Lys He Ser Glu Gly Leu Pro Thr Pro Thr 
755 760 765 

AAA ATG ACT CCA AGA TCA AGA ATC TTA GTA TCA ATT GGT GAA TCA TTC 2352 
Lys Met Thr Pro Arg Ser Arg He Leu Val Ser He Gly Glu Ser Phe 
770 775 780 

GGG ACT TCT GAG AAG TTC CAG AAA ATA AAT CAG ATG GTA TGT AAC AGC 2400 
Gly Thr Ser Glu Lys Phe Gin Lys He Asn Gin Met Val Cys Asn Ser 
785 790 795 

GAC CGT GTG CTC AAA AGA AGT GCT GAA GGA AGC AAC CCT CCT AAA CCA 2448 
Asp Arg Val Leu Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro 
800 805 810 
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CTG AAA AAA CTA CGC TTT GAT ATT GAA GGA TCA GAT GAA GCA GAT GGA 24 96 
Leu Lys Lys Leu Arg Phe Asp lie Glu Gly Ser Asp Glu Ala Asp Gly 
815 820 825 830 

AGT AAA CAT CTC CCA GGA GAG TCC AAA TTT CAG CAG AAA CTG GCA GAA 2 544 
Ser Lys His Leu Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu 
835 840 845 

ATG ACT TCT ACT CGA ACA CGA ATG CAA AAG CAG AAA ATG AAT GAT AGC 2 5 92 
Met Thr Ser Thr Arg Thr Arg Met Gin Lys Gin Lys Met Asn Asp Ser 
850 855 860 

ATG GAT ACC TCA AAC AAG GAA GAG AAA TGAGGATCTC AGGACCTTGG 263 9 
Met Asp Thr Ser Asn Lys Glu Glu Lys 
865 870 

TGGACACTGT GTACACCTCT GGATTCATTG TCTCTCACAG ATGTGACTGT ATAACTTTCC 2 699 

CAGGTTCTGT TTATGGCCAC ATTTAATATC TTCAGCTCTT TTTGTGGATA TAAAATGTGC 275 9 

AGATGCAATT GTTTGGGTGA TTCCTAAGCC ACTTGAAATG TTAGTCATTG TTATTTATAC 2 819 

AAGATTGAAA ATCTTGTGTA AATCCTGCCA TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT 2879 

TCCAAAGTAA AATTGCTGTG CTTTATGGAT AGTAAGAATG GCCCTAGAGT GGGAGTCCTG 2 93 9 

ATAACCCAGG CCTGTCTGAC TACTTTGCCT TCTTTTGTAG CATATAGGTG ATGTTTGCTC 2 999 

TTGTTTTTAT TAATTTATAT GTATATTTTT TTAATTTAAC ATGAACACCC TTAGAAAATG 3 05 9 

TGTCCTATCT ATCTTCCAAA TGCAATTTGA TTGACTGCCC ATTCACCAAA ATTATCCTGA 3119 

ACTCTTCTGC AAAAATGGAT ATTATTAGAA ATTAGAAAAA AATTACTAAT TTTACACATT 317 9 

AGATTTTATT TTACTATTGG AATCTGATAT ACTGTGTGCT TGTTTTATAA AATTTTGCTT 32 39 

TTAATTAAAT AAAAGCTGGA AGCAAAGTAT AACCATATGA TACTATCATA CTACTGAAAC 32 99 

AGATTTCATA CCTCAGAATG TAAAAGAACT TACTGATTAT TTTCTTCATC CAACTTATGT 33 59 

TTTTAAATGA GGATTATTGA TAGT 3 3 83 



(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
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Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 



Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 

Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp Leu Asp Glu 
100 105 110 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu lie Ser Val 
115 120 125 

His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser Thr Lys Val 
130 135 140 

Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
145 150 155 160 

Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr Gin 
165 170 175 

Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu Lys 
180 185 190 

Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
195 200 205 

Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val Leu Asp 
210 215 220 

Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys 
225 230 235 240 

Thr Gly Ser Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu 
245 250 255 

Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys 
260 265 270 

Asp Leu Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr 
275 280 285 
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Asp Ser lie Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn 
290 295 300 



Leu Asp Glu Glu Val Asn Val lie Pro Pro His Thr Pro Val Arg Thr 
305 310 315 320 

Val Met Asn Thr lie Gin Gin Leu Met Met He Leu Asn Ser Ala Ser 
325 330 335 

Asp Gin Pro Ser Glu Asn Leu He Ser Tyr Phe Asn Asn Cys Thr Val 
340 345 350 

Asn Pro Lys Glu Ser He Leu Lys Arg Val Lys Asp He Gly Tyr He 
355 360 365 

Phe Lys Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu He 
370 375 380 

Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met 
385 390 395 400 

Glu Ser Met Leu Lys Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe 
405 410 415 

Ser Lys Leu Leu Asn Asp Asn He Phe His Met Ser Leu Leu Ala Cys 
420 425 430 

Ala Leu Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn 
435 440 445 

Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu 
450 455 460 

Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val He Glu Ser Phe He Lys 
465 470 475 480 

Ala Glu Gly Asn Leu Thr Arg Glu Met He Lys His Leu Glu Arg Cys 
485 490 495 

Glu His Arg He Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu 
500 505 510 

Phe Asp Leu He Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His 
515 520 525 

Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr 
530 535 540 

Ala Ala Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly 
545 550 555 560 

Ser Thr Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr 
565 570 575 
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Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu 
580 585 590 



Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu 
595 600 605 

Cys Glu Arg Leu Leu Ser Glu His Pro Glu Leu Glu His lie He Trp 
610 615 620 

Thr Leu Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp 
625 630 635 640 

Arg His Leu Asp Gin He Met Met Cys Ser Met Tyr Gly He Cys Lys 
645 650 655 

Val Lys Asn He Asp Leu Lys Phe Lys He He Val Thr Ala Tyr Lys 
660 665 670 

Asp Leu Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu He Lys 
675 680 685 

Glu Glu Glu Tyr Asp Ser He He Val Phe Tyr Asn Ser Val Phe Met 
690 695 700 

Gin Arg Leu Lys Thr Asn He Leu Gin Tyr Ala Ser Thr Arg Pro Pro 
705 710 715 720 

Thr Leu Ser Pro lie Pro His He Pro Arg Ser Pro Tyr Lys Phe Pro 
725 730 735 

Ser Ser Pro Leu Arg He Pro Gly Gly Asn He Tyr He Ser Pro Leu 
740 745 750 

Lys Ser Pro Tyr Lys He Ser Glu Gly Leu Pro Thr Pro Thr Lys Met 
755 760 765 

Thr Pro Arg Ser Arg He Leu Val Ser He Gly Glu Ser Phe Gly Thr 
770 775 780 

Ser Glu Lys Phe Gin Lys He Asn Gin Met Val Cys Asn Ser Asp Arg 
785 790 795 800 

Val Leu Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys 
805 810 815 

Lys Leu Arg Phe Asp He Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys 
820 825 830 

His Leu Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr 
835 840 845 

Ser Thr Arg Thr Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp 
850 855 860 
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Thr Ser Asn Lys Glu Glu Lys 
865 870 



(2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3554 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 7.. 2790 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 



CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 

GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 

GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 24 0 

Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 

GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 2 88 

Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys 
80 85 90 



AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 33 6 

Lys Lys Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu 
35 100 105 110 

GGT GAT ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 384 
Gly Asp Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He 
115 120 125 

AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 432 
Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr 
130 135 140 
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AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 480 

Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
145 150 155 

TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 52 8 

Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu 

160 165 170 

ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG 576 

Thr Gin Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val 
175 180 185 190 

CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA 624 

Leu Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu 
195 200 205 

CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC 672 

Gin Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val 
210 215 220 

CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA 72 0 

Leu Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro 
225 230 235 

TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG 768 

Tyr Lys Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg 

240 245 250 

CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT 816 

Arg Gly Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp 
255 260 265 270 

ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT 864 

Thr Arg He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp 
275 280 285 

GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT 912 

Glu Val Lys Asn t Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser 
290 295 300 

CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT 960 

Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser 
305 310 315 

AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA 1008 

Lys Arg Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg 

320 325 330 

TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT 1056 

Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser 
335 340 345 350 



-240- 



TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG 1104 
Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val 
355 360 365 

AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC 1152 
Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie 
370 375 380 

CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA 12 00 

Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu 
385 390 395 

AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT 1248 
Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser 
400 405 410 

ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT 1296 
lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys Phe 
415 420 425 430 

GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC 1344 
Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr 
435 440 445 

AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA 13 92 

Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys 
450 455 460 

TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT 1440 
Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn 
465 470 475 

GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA 14 88 

Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val 
480 485 490 

ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA 153 6 

Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr 
495 500 505 510 

GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT 1584 
Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe 
515 520 525 

GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG 1632 
Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu 
530 535 540 

ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG 1680 
Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met 
545 550 555 
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GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA 172 8 

Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys 
560 565 570 

CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT 1776 
Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys 
575 580 585 590 

CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT 1824 
Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr 
595 600 605 

CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA 18 72 

Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val 
610 615 620 

AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC 192 0 

Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr 
625 630 635 

CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG 1968 
Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val 
640 645 650 

TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG 2 016 

Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu 
655 660 665 670 

TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC 2 064 

Ser Glu His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His 
675 680 685 

ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA 2112 
Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin 
690 695 700 

ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC 2160 
He Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp 
705 710 715 

CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT 22 08 

Leu Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala 
720 725 730 

GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT 2256 
Val Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp 
735 740 745 750 

TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA 2304 
Ser He He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr 
755 760 765 
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AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA 23 52 

Asn lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie 
770 775 780 

CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG 2400 
Pro His lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg 
785 790 795 

ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA 244 8 

lie Pro Gly Gly Asn lie Tyr lie Ser Pro Leu Lys Ser Pro Tyr Lys 
800 805 810 

ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA 2496 
lie Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg 
815 820 825 830 

ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG 2544 
lie Leu Val Ser lie Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin 
835 840 845 

AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT 2592 
Lys lie Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser 
850 855 860 

GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT 264 0 

Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp 
865 870 875 

ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG 2688 
lie Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu 
880 885 890 

TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA 2736 
Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg 
895 900 905 910 

ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA 2 784 

Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu 
915 920 925 

GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG 2 84 0 

Glu Lys 



TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC 2 900 

TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC 2 960 

ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA 3 020 

TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT 3080 

AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT 314 0 



-243- 



TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT 32 00 

TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA 3260 

TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA 332 0 

ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT 3 380 

ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT 344 0 

AAC CATATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT 3500 

TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3554 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 92 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 . 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 

Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu Gly Asp 
100 105 110 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser Val 
115 120 125 

His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val 
130 135 140 
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Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
145 150 155 160 



Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr Gin 
165 170 175 

Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu Lys 
180 185 190 

Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
195 200 205 

Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val Leu Asp 
210 215 220 

Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys 
225 230 235 240 

Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly 
245 250 255 

Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu Glu Asn Asp Thr Arg 
260 265 270 

lie lie Glu Val Leu Cys Lys Glu His Glu Cys Asn lie Asp Glu Val 
275 280 285 

Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn Ser Leu Gly 
290 295 300 

Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg 
305 310 315 320 

Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe 
325 330 335 

Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe Glu 
340 345 350 

Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val 
355 360 365 

lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin Gin 
370 375 380 

Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu 
385 390 395 400 

lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie Leu 
405 410 415 

Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys 
420 425 430 
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Ala Val Gly Gin Gly Cys Val Glu lie Gly Ser Gin Arg Tyr Lys Leu 
435 440 445 



Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu 
450 455 460 

Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn 
465 470 475 480 

lie Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala 
485 490 495 

Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu 
500 505 510 

Ser Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe 
515 520 525 

Tyr Lys Val lie Glu Ser Phe lie Lys Ala Glu Gly Asn Leu Thr Arg 
530 535 540 

Glu Met lie Lys His Leu Glu Arg Cys Glu His Arg lie Met Glu Ser 
545 550 555 560 

Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys Gin Ser 
565 570 575 

Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu 
580 585 590 

Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser 
595 600 605 

Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser 
610 615 620 

Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys 
625 630 635 640 

Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg 
645 650 655 

Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu 
660 665 670 

His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His Thr Leu 
675 680 685 

Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin lie Met 
690 695 700 

Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie Asp Leu Lys 
705 710 715 720 
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Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin 
725 730 735 



Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp Ser lie 
740 745 750 

lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn lie 
755 760 765 

Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie Pro His 
770 775 780 

lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro 
785 790 795 800 

Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser 
805 810 815 

Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu 
820 825 830 

Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He 
835 840 845 

Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu 
850 855 860 

Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu 
865 870 875 880 

Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys 
885 890 895 

Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin 
900 905 910 

Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
915 920 925 
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WHAT IS CLAIMED IS: 



1 . A DNA segment comprising an isolated gene encoding a modified retinoblastoma tumor 
suppressor protein other than pRB 94 , said modified retinoblastoma tumor suppressor protein 

5 comprising an N-terminal modification. 

2. The DNA segment of claim 1 ? wherein said gene encodes a modified retinoblastoma 
tumor suppressor protein comprising an N-terminal region that comprises a first sequence region 

10 from which at least one amino acid has been deleted. 

yi 3. The DNA segment of claim 2, wherein at least two amino acids have been deleted from 
51 said first sequence region. 

15 Oj 

4. The DNA segment of claim 3, wherein at least about 25 amino acids have been deleted 
^ from said first sequence region. 

20 '% 

5. The DNA segment of claim 4, wherein at least about 100 amino acids have been deleted 
from said first sequence region. 

25 6. The DNA segment of claim 5, wherein at least about 150 amino acids have been deleted 
from said first sequence region. 

7. The DNA segment of claim 6, wherein at least about 300 amino acids have been deleted 
30 from said first sequence region. 
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The DNA segment of claim 2, wherein said first sequence region is 

a) between about amino acid 1 and about amino acid 50; 

b) between about amino acid 5 1 and about amino acid 100; 

c) between about amino acid 101 and about amino acid 150; 

d) between about amino acid 1 5 1 and about amino acid 200; 

e) between about amino acid 201 and about amino acid 250; 

f) between about amino acid 251 and about amino acid 300; 

g) between about amino acid 1 and about amino acid 100; 

h) between about amino acid 5 1 and about amino acid 1 50; 

i) between about amino acid 101 and about amino acid 200; 
j) between about amino acid 151 and about amino acid 250; 
k) between about amino acid 201 and about amino acid 300; 
1) between about amino acid 1 and about amino acid 150; 
m) between about amino acid 5 1 and about amino acid 200; 
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n) between about amino acid 101 and about amino acid 250; 

o) between about amino acid 151 and about amino acid 300; 

5 p) between about amino acid 1 and about amino acid 200; 

q) between about amino acid 5 1 and about amino acid 250; 

r) between about amino acid 101 and about amino acid 300; 

10 

s) between about amino acid 1 and about amino acid 250; 

yi t) between about amino acid 5 1 and about amino acid 300; or 

15 «j; u) between about amino acid 1 and about amino acid 300. 

J ! 9, The DNA segment of claim 2 9 wherein: 

20 yi a) about amino acid 2 through about amino acid 34 have been deleted from said first 

sequence region; 

b) about amino acid 2 through about amino acid 55 have been deleted from said first 
sequence region; 

25 

c) about amino acid 2 through about amino acid 78 have been deleted from said first 
sequence region; 

d) about amino acid 2 through about amino acid 97 have been deleted from said first 
30 sequence region; 
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e) about amino acid 2 through about amino acid 148 have been deleted from said 
first sequence region; 

f) about amino acid 3 1 through about amino acid 1 07 have been deleted from said 
first sequence region; 

g) about amino acid 77 through about amino acid 107 have been deleted from said 
first sequence region; 

h) about amino acid 1 1 1 through about amino acid 181 have been deleted from said 
first sequence region; 

i) about amino acid 111 through about amino acid 241 have been deleted from said 
first sequence region; 

j) about amino acid 181 through about amino acid 241 have been deleted from said 
first sequence region; or 

k) about amino acid 242 through about amino acid 300 have been deleted from said 
first sequence region. 



10. The DNA segment of claim 2, wherein said N-terminal region of said modified 
retinoblastoma tumor suppressor protein further comprises a second sequence region from which 
at least one amino acid has been deleted. 



1 1 . The DNA segment of claim 10, wherein about amino acid 2 through about amino acid 34, 
and about amino acid 76 through about amino acid 112 have been deleted. 
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12. The DNA segment of claim 10, wherein about amino acid 2 through about amino acid 55, 
and about amino acid 76 through about amino acid 1 12 have been deleted. 

13. The DNA segment of claim 1, wherein said gene encodes a modified retinoblastoma 
tumor suppressor protein comprising at least a first N-terminal mutation, and wherein said 
modified retinoblastoma tumor suppressor protein has an increased biological activity in 
comparison to the biological activity of the corresponding wild-type retinoblastoma tumor 
suppressor protein. 

14. The DNA segment of claim 13, wherein said gene encodes a modified retinoblastoma 
tumor suppressor protein comprising a mutation at position 111. 

15. The DNA segment of claim 14, wherein said modified retinoblastoma tumor suppressor 
protein comprises glycine at position 1 1 1 in place of aspartic acid. 

16. The DNA segment of claim 13, wherein said modified retinoblastoma tumor suppressor 
protein comprises at least a second N-terminal mutation. 

17. The DNA segment of claim 16, wherein said gene encodes a modified retinoblastoma 
tumor suppressor protein comprising a mutation at position 1 1 1 and a mutation at position 112. 
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18. The DNA segment of claim 17, wherein said modified retinoblastoma tumor suppressor 
protein comprises glycine at position 1 1 1 in place of aspartic acid, and aspartic acid at position 
1 12 in place of glutamic acid. 

5 

19. The DNA segment of claim 1, wherein said gene encodes a modified retinoblastoma 
tumor suppressor protein comprising an N-terminal region from which at least one amino acid 
has been deleted, and which contains at least one amino acid mutation. 

10 

20. The DNA segment of claim 2, wherein said gene encodes a modified retinoblastoma 
tumor suppressor protein that comprises at least the C-terminal amino acid sequence from about 

]i position 370 to about position 928 of SEQ ID NO:2. 

15 0] 

yl 21. The DNA segment of claim 2, wherein said gene encodes a modified retinoblastoma 
" tumor suppressor protein comprising the contiguous amino acid sequence of SEQ ID NO:29; 
O SEQ ID NO:31; SEQ ID NO:33; SEQ ID NO:35; SEQ ID NO:37; SEQ ID NO:39; SEQ ID 
M NO:41; SEQ ID NO:43; SEQ ID NO:45; SEQ ID NO:47; SEQ ID NO:49; or SEQ ID NO:51. 

20 % 

22. The DNA segment of claim 2, wherein said gene comprises the contiguous nucleic acid 
sequence from between position 7 and position 2691 of SEQ ID NO:28; from between position 7 
and position 2628 of SEQ ID NO:30; from between position 7 and position 2559 of SEQ ID 
25 NO:32; from between position 7 and position 2502 of SEQ ID NO:34; from between position 7 
and position 2349 of SEQ ID NO:36; from between position 7 and position 2559 of SEQ ID 
NO:38; from between position 7 and position 2697 of SEQ ID NO:40; from between position 7 
and position 2583 of SEQ ID NO:42; from between position 7 and position 2397 of SEQ ID 
NO:44; from between position 7 and position 2613 of SEQ ID NO:46; from between position 7 
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and position 2619 of SEQ ID NO:48; or from between position 7 and position 2790 of SEQ ID 
NO:50. 

23. The DNA segment of claim 1 , operationally positioned under the control of a promoter. 

24. The DNA segment of claim 23, further defined as a recombinant vector. 

25. The DNA segment of claim 24, wherein said recombinant vector is comprised within an 
adenoviral vector. 

26. The DNA segment of claim 25, wherein said adenoviral vector is comprised within a 
recombinant adenovirus. 

27. The DNA segment of claim 1 , comprised within a host cell. 

28. The DNA segment of claim 27, wherein said host cell is a eukaryotic cell. 

29. The DNA segment of claim 28, wherein said host cell is a human cell. 

30. The DNA segment of claim 28, wherein said host cell is a tumor cell 
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31. The DNA segment of claim 28, wherein said host cell is comprised within an animal. 



32. The DNA segment of claim 3 1 , wherein said animal is a human subject. 

33. The DNA segment of claim 1, dispersed in a pharmaceutically acceptable excipient. 

34. The DNA segment of claim 1, wherein said modified retinoblastoma tumor suppressor 
protein is characterized as: 

a) comprising an N-terminal region that comprises at least a first sequence region 
from which at least one amino acid has been deleted, and wherein said modified 
retinoblastoma tumor suppressor protein has a biological activity at least about 
equivalent to the biological activity of the corresponding wild-type retinoblastoma 
tumor suppressor protein; or 

b) comprising an N-terminal region that comprises a first sequence region 
comprising at least one mutation, and wherein said modified retinoblastoma tumor 
suppressor protein has an increased biological activity in comparison to the 
biological activity of the corresponding wild-type retinoblastoma tumor 
suppressor protein. 

35. A modified retinoblastoma tumor suppressor protein other than pRB 94 , said modified 
retinoblastoma tumor suppressor protein comprising an N-terminal modification, wherein said 
modified retinoblastoma tumor suppressor protein has a biological activity at least about 
equivalent to the biological activity of the corresponding wild-type retinoblastoma tumor 
suppressor protein. 
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36. A recombinant host cell comprising a DNA segment comprising an isolated gene 
encoding a modified retinoblastoma tumor suppressor protein other than pRB 94 , said modified 

5 retinoblastoma tumor suppressor protein comprising an N-terminal modification. 

37. The recombinant host cell of claim 36, wherein said host cell is a tumor cell. 

10 

38. A method of inhibiting cellular proliferation, comprising contacting a cell with an 
effective inhibitory amount of a first modified retinoblastoma tumor suppressor protein other 

~g than pRB 94 , said modified retinoblastoma tumor suppressor protein comprising an N-terminal 
Jrf modification. 

15 CP 

^~ 39. The method of claim 38, wherein said cell is contacted with said first modified 
retinoblastoma tumor suppressor protein by providing to said cell a DNA segment that expresses 
M said first modified retinoblastoma tumor suppressor protein in said cell. 

20 J{ 

40. The method of claim 38, wherein said cell is located within an animal and said first 
modified retinoblastoma tumor suppressor protein, or a gene encoding said modified 
retinoblastoma tumor suppressor protein, is administered to said animal in a pharmaceutically 

25 acceptable vehicle. 

41. The method of claim 38, wherein said cell is contacted with a modified retinoblastoma 
tumor suppressor protein and a p53 tumor suppressor protein in a combined amount effective to 

30 inhibit cellular proliferation in said cell 
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42. A method of inhibiting cellular proliferation, comprising contacting a cell with a 
retinoblastoma protein and a p53 protein in a combined amount effective to inhibit cellular 
proliferation in said cell. 

43. A method of treating cancer, comprising administering to an animal with cancer a 
pharmaceutically acceptable composition comprising a biologically effective inhibitory amount 
of a first modified retinoblastoma tumor suppressor protein, other than pRB 94 , that comprises an 
N-terminal modification. 
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ABSTRACT 



Disclosed are modified broad-spectrum retinoblastoma tumor suppressor proteins that 
have at least the same, and in most cases higher biological activity than the corresponding 
wild-type retinoblastoma tumor suppressor protein. Exemplary modified retinoblastoma tumor 
suppressor proteins have a modified N-terminal region, in particular comprising one or more 
deletions and/or mutations. Also disclosed are methods of making and using the modified 
retinoblastoma tumor suppressor proteins, particularly in circumstances where inhibition of cell 
growth is desired. Thus the present disclosure provides methods for treating diseases, as 
exemplified by, but not limited to cancer, that are characterized by abnormal cellular 
proliferation. 
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SEQUENCE LISTING 



GENERAL INFORMATION: 

(i) APPLICANT: Xu, Hong-Ji 
Hu, Shi-Xue 
Benedict, William F. 
Zhou, Yunli 

(ii) TITLE OF INVENTION: MODIFIED RETINOBLASTOMA TUMOR SUPPRESSOR 
PROTEINS 

(iii) NUMBER OF SEQUENCES: 51 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Arnold, White & Durkee 

(B) STREET: P.O. Box 4433 

(C) CITY: Houston 

(D) STATE: TX 

(E) COUNTRY: USA 

(F) ZIP: 77210-4433 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US UNKNOWN 

(B) FILING DATE: 

(C) CLASSIFICATION: UNKNOWN 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/038,118 

(B) FILING DATE: 20-FEB-1997 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Hibler, David W. 

(B) REGISTRATION NUMBER: 41,071 

(C) REFERENCE /DOCKET NUMBER: UTXC:506 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 512/418-3000 

(B) TELEFAX: 512/474-7577 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 555 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 7.. 2 790 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 

GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 

GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 240 
Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 

GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 288 
Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys 
80 85 90 

AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 336 
Lys Lys Glu Leu Trp Gly lie Cys He Phe He Ala Ala Val Asp Leu 
95 100 105 110 

GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 384 
Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He 
115 120 125 

AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 432 
Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr 
130 135 140 

AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 48 0 

Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
145 150 155 



TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 528 
Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu 
160 165 170 

ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG 576 
Thr Gin Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val 
175 180 185 190 

CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA 624 
Leu Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu 
195 200 205 

CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC 672 
Gin Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val 
210 215 220 

CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA 720 
Leu Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro 
225 230 235 

TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG 768 
Tyr Lys Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg 
240 245 250 

CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT 816 
Arg Gly Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp 
255 260 265 270 

ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT 864 
Thr Arg He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp 
275 280 285 

GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT 912 
Glu Val Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser 
290 295 300 

CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT 96 0 

Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser 
305 310 315 

AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA 1008 
Lys Arg Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg 
320 325 330 

TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT 1056 
Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser 
335 340 345 350 

TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG 1104 
Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val 
355 360 365 



AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC 1152 
Asn Val He Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He 
370 375 380 

CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA 1200 
Gin Gin Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu 
385 390 395 

AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT 1248 
Asn Leu He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser 
400 405 410 

ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT 12 96 

He Leu Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe 
415 420 425 430 

GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC 1344 
Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr 
435 440 445 

AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA 13 92 

Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys 
450 455 460 

TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT 1440 
Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn 
465 470 475 

GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA 1488 
Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val 
480 485 490 

ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA 1536 
Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr 
495 500 505 510 

GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT 1584 
Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe 
515 520 525 

GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG 1632 
Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu 
530 535 540 

ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG 1680 
Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met 
545 550 555 

GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA 1728 
Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys 
560 565 570 



CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT 1776 
Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys 
575 580 585 590 



CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT 
Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr 
595 600 605 

CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA 
Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val 
610 615 620 

AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC 
Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr 
625 630 635 

CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG 
Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val 
540 645 650 

TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG 
Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu 
655 660 665 670 

TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC 
Ser Glu His Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His 
675 680 685 

ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA 
Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin 
690 695 700 

ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC 
He Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp 
705 710 715 

CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT 
Leu Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala 
720 725 730 

GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT 
Val Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp 
735 740 745 750 

TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA 
Ser He He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr 
755 760 765 

AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA 
Asn He Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He 
770 775 780 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG 
Pro His He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg 
785 790 795 



2400 



ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA 2448 
He Pro Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys 
800 805 810 

ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA 2496 
He Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg 
815 820 825 830 

ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG 2544 
He Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin 
835 840 845 

AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT 2592 
Lys He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser 
850 855 860 

GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT 2640 
Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp 
865 870 875 

ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG 2688 
He Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu 
880 885 890 

TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA 2736 
Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg 
895 900 905 910 

ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA 2 784 

Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu 
915 920 925 

GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG 2840 
Glu Lys 



TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC 2900 

TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC 2960 

ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA 302 0 

TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT 3080 

AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT 3140 

TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT 32 00 

TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA 3260 



TTGACTGCCC ATTC AC C AAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA 3320 

ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT 3380 

ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT 3440 

AACCATATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT 3500 

TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGTC 3555 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 

Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu Asp Glu 
100 105 110 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser Val 
115 120 125 

His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val 
130 135 140 

Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
145 150 155 160 

Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr Gin 
165 170 175 



Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys 
180 185 190 



Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
!95 200 205 

Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu Asp 
210 215 220 

Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys 
225 230 235 240 

Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly 
245 250 255 

Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg 
260 265 270 

He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val 
275 280 285 

Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly 
290 295 300 

Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg 
305 310 315 320 

Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe 
325 330 335 

Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu 
340 345 350 

Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val 
355 360 365 

He Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin 
370 375 380 

Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu 
385 390 395 400 

He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu 
405 410 415 

Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys 
420 425 430 

Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu 
435 440 445 

Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu 
450 455 460 



Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn 
«5 470 475 480 

He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala 
485 490 495 

Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu 
500 505 510 

Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe 
515 520 525 

Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg 
530 535 540 

Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser 
545 550 555 560 

Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser 
565 570 575 

Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu 
580 585 590 

Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser 
595 600 605 

Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser 
610 615 620 

Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys 
625 630 635 640 

Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg 
645 650 655 

Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu 
660 665 670 

His Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His Thr Leu 
675 680 685 

Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He Met 
690 695 700 

Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys 
705 710 715 720 

Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin 
725 730 735 

Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He 
740 745 750 



He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He 
755 760 765 



Leu Gin Tyr Ala 
770 

He Pro Arg Ser 
785 

Gly Gly Asn He 



Glu Gly Leu Pro 
820 

Val Ser He Gly 
835 

Asn Gin Met Val 
850 

Gly Ser Asn Pro 
865 

Gly Ser Asp Glu 



Phe Gin Gin Lys 
900 

Lys Gin Lys Met 
915 



Ser Thr Arg Pro 
775 

Pro Tyr Lys Phe 
790 

Tyr He Ser Pro 
805 

Thr Pro Thr Lys 



Glu Ser Phe Gly 
840 

Cys Asn Ser Asp 
855 

Pro Lys Pro Leu 
870 

Ala Asp Gly Ser 
885 

Leu Ala Glu Met 



Asn Asp Ser Met 
920 



Pro Thr Leu Ser 
780 

Pro Ser Ser Pro 
795 

Leu Lys Ser Pro 
810 

Met Thr Pro Arg 
825 

Thr Ser Glu Lys 



Arg Val Leu Lys 
860 

Lys Lys Leu Arg 
875 

Lys His Leu Pro 
890 

Thr Ser Thr Arg 
905 

Asp Thr Ser Asn 



Pro He Pro His 



Leu Arg He Pro 
800 

Tyr Lys He Ser 
815 

Ser Arg He Leu 
830 

Phe Gin Lys He 
845 

Arg Ser Ala Glu 



Phe Asp He Glu 
880 

Gly Glu Ser Lys 
895 

Thr Arg Met Gin 
910 

Lys Glu Glu Lys 
925 



( 2 ) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3218 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 7.. 2454 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCCGTC ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 
Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He 
1 5 10 



AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 
Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser Thr 
15 20 25 30 



96 



AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 144 
Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
35 40 45 

TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 192 
Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu 
50 55 60 

ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG 240 
Thr Gin Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val 
65 70 75 

CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA 288 
Leu Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu 
80 85 90 

CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC 336 
Gin Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val 
95 100 105 110 

CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA 384 
Leu Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro 
115 120 125 



TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG 
Tyr Lys Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg 
130 135 140 

CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT 
Arg Gly Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp 
145 150 155 



GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT 
Glu Val Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser 
175 180 las 190 



AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA 
Lys Arg Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg 
210 215 220 



432 



480 



ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT 528 
Thr Arg He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp 
160 165 170 



576 



CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT 624 
Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser 
195 200 205 



672 



TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT 72 0 

Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser 
225 230 235 

TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG 768 
Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val 
240 245 250 

AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC 816 
Asn Val He Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He 
255 260 265 270 

CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA 864 
Gin Gin Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu 
275 280 285 

AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT 912 
Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser 
290 295 300 

ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT 960 
He Leu Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe 
305 310 315 

GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC 1008 
Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr 
320 325 330 

AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA 1056 
Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys 
335 340 345 350 

TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT 1104 
Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn 
355 360 365 

GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA 1152 
Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val 
37 0 375 380 

ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA 12 00 

Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr 
385 390 395 

GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT 1248 
Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe 
400 405 410 

GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG 12 96 

Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu 
415 420 425 430 



ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG 1344 

Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met 
435 440 445 

GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA 1392 

Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys 
450 455 460 



CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT 
Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys 
465 470 475 



CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG 
Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val 
530 535 540 

TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG 
Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu 
545 550 555 

TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC 
Ser Glu His Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His 
560 565 570 

ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA 
Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin 
575 580 585 590 



1440 



CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT 1488 
Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr 
480 485 490 

CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA 1536 
Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val 
495 500 505 510 

AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC 1584 
Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr 
515 520 525 



1632 



1680 



1728 



1776 



ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC 1824 
He Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp 
595 600 605 

CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT 1872 
Leu Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala 
610 615 620 

GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT 1920 
Val Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp 
625 630 635 



TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA 1968 
Ser He He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr 
640 645 650 

AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA 2 016 

Asn He Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie 
655 660 665 670 



CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG 
Pro His He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg 
675 680 685 

ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA 
He Pro Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys 
59 0 695 700 

ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA 
He Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg 
705 710 715 



AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT 
Lys He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser 
735 740 745 750 

GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT 
Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp 
755 760 765 

ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG 
He Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu 
770 775 780 



ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA 
Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu 
800 805 810 

GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG 

Glu Lys 

815 

TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC 
TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC 
ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA 



2064 



2112 



2160 



ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG 2208 
He Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin 
720 725 730 



2256 



2304 



2352 



TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA 2400 
Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg 
785 790 795 



2448 

2504 

2564 
2624 
2684 



TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT 2744 

AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT 2 804 

TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT 2864 

TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA 2924 

TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA 2984 

ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT 3 044 

ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT 3104 

AAC CATATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT 3164 

TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3218 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser Val 
15 10 15 

His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val 
20 25 30 

Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
35 40 45 

Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr Gin 
5 0 55 60 

Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys 
65 70 75 80 

Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
85 90 95 

Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu Asp 
100 105 110 

Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys 
115 120 125 



Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly 
130 135 140 

Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg 
145 15 ° 155 160 

He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val 
165 170 175 

Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly 
180 185 190 

Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg 
195 200 205 

Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe 
210 215 220 

Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu 

225 23 0 235 240 

Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val 
245 250 255 

He Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin 

260 265 270 

Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu 
275 280 285 

He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu 
290 295 300 

Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys 
305 310 315 320 

Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu 

325 330 335 

Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu 

340 345 350 

Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn 
355 360 365 

He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala 
370 375 380 

Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu 
385 390 395 400 

Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe 
405 410 415 



Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg 
420 425 430 

Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser 
435 440 445 

Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser 
450 455 460 

Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu 
465 470 475 480 

Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser 
485 490 495 

Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser 
500 505 510 

Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys 
515 520 525 

Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg 
530 535 540 

Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu 
545 550 555 560 

His Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His Thr Leu 
565 570 575 

Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He Met 
580 585 590 

Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys 
595 600 605 

Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin 
610 615 620 

Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He 
525 630 635 640 

He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He 
645 550 655 

Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro His 
660 665 670 

He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro 
675 680 685 

Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser 
590 695 700 



Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg lie Leu 
705 710 715 720 



Val Ser lie Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys lie 
725 730 735 

Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu 
740 745 750 

Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp lie Glu 
755 760 765 

Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys 
770 775 780 

Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin 
785 790 795 800 



Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
805 810 815 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

CTCGAGCAAT GGGCGTGATA GCGGTTTGAC TCACGGGGAT TTCCAAGTCT CCACCCCATT 60 

GACGTCAATG GGAGTTTGTT TTGGCACCAA AATCAACGGG ACTTTCCAAA ATGTCGTAAC 120 

AACTCCGCCC CATTGACGCA AATGGGCGGT AGGCGTGTAC GGTGGGAGGT CTATATAAGC 180 

AGAGCTCGTT TAGTGAACCG TCAGATCGCC TGGAGACGCC ATCCACGCTG TTTTGACCTC 240 

CATAGAAGAC ACCGGGACCG ATCCAGCCTC CGCGGCCGCG AATTC 285 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



CCGCTCGAGC AATGGGCGTG GATAGCGG 



28 



(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
CCGCTCGAGC AC CAAAATCA ACGGGA 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CCGCTCGAGC AACTCCGCCC CATTGAC 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TAGACATATG AATTCGCGGC C 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTAGAATTCG CTGTCTGCG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GCTCTAGATG CAGTTGGACC TGGGAG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
CCCAAGCTTG CCGCCATGTC GTTCACTTTT AC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
GTCCAAGAGA ATTCATAAAA GG 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



CCCAAGCTTG CCGCCATGGA GCAGGACAGC GGCCCGGAC 



(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
CCCAAGCTTG CCGCCATGGA TTTTACTGCA TTATGTCAG 



(2) INFORMATION FOR SEQ ID NO : 16 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CCCAAGCTTG CCGCCATGGA GAAAGTTTCA TCTTGTGAT 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CCCAAGCTTG CCGCCATGCT GTGGGGAATC TGTATCTTT 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
CCCAAGCTTG CCGCCATGTC AAGACTGTTG AAGAAG 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
GCGCCTGAGG ACCTAGATGA GATGTCGTTC 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GCGGTTAACC CTAGATGAGA TGTCGTTCAC T 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
CCCAAGCTTG CCGTCATGCC GCCCAAAACC CCCCGA 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
CTCACCTAGG TCAACTGCTG CAAT 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
GTTGACCTAG GTGATATGTC GTTC 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GCGCCTAGGA TCTACTGAAA TAAATTCTGC A 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
CCCGATATCA ACTGCTGGGT TGTGTCAAAT A 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
CCCGAATTCG TTTTATATGG TTCTTTGAGC AA 



(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 
<B) LOCATION: 4.. 5 

(D) OTHER INFORMATION: /note= "R=A or 6" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GCCRCCAUGG 10 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 2691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

GCCGTC ATG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG CTT 48 
Met Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu 



1 



5 



10 



GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG AAA 
Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys 
15 20 25 30 



96 



TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG GAG 
Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu 
35 40 45 



144 



AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG AAA 
Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys Lys 
50 55 60 



192 



AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA GAT 
Lys Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu Asp 
65 70 75 



240 



GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC AGT 28 8 

Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser 
80 85 90 

GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC AAA 336 
Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys 
95 100 105 110 

GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG TTT 384 
Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe 
115 120 125 

GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG ACA 432 
Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr 
130 135 140 

CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG CTA 480 
Gin Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val Leu 
145 150 155 

AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA CAA 528 
Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin 
160 165 170 

ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC CTT 576 
Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu 
175 180 185 190 

GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA TAT 624 
Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr 
195 200 205 

AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG CGA 672 
Lys Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg 
210 215 220 

GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT ACA 720 
Gly Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr 
225 230 235 

AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT GAG 768 
Arg He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu 
240 245 250 

GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT CTT 816 
Val Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu 
255 260 265 270 

GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT AAA 864 
Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys 
275 280 285 



CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA TTA 912 
Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu 
290 295 300 

TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT TTT 960 
Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe 
305 310 315 

GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG AAT 1008 
Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn 
320 325 330 

GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC CAA 1056 
Val He Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He Gin 
335 340 345 350 

CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA AAT 1104 
Gin Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn 
355 360 365 

CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT ATA 1152 
Leu He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He 
370 375 380 

CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT GCT 1200 
Leu Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala 
385 390 395 

AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC AAA 1248 
Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys 
400 405 410 

CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA TCA 1296 
Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser 
415 420 425 430 

GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT GAC 1344 
Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp 
435 440 445 

AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA ATG 1392 
Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met 
450 455 460 

GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA GAT 1440 
Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp 
465 470 475 

TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT GAT 1488 
Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp 
480 485 490 



TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG ACA 1536 
Phe Tyr Lys Val lie Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr 
495 500 505 510 

AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG GAA 1584 
Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met Glu 
515 520 525 

TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA CAA 1632 
Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin 
530 535 540 

TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT CCT 1680 
Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro 
545 550 555 

CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT CTT 1728 
Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu 
560 565 570 

TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA AAT 1776 
Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn 
575 580 585 590 

TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC CAG 1824 
Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin 
595 600 605 

AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG TAT 1872 
Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr 
610 615 620 

CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG TCT 1920 
Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser 
625 630 635 

GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC ACC 1968 
Glu His Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His Thr 
640 645 650 

CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA ATT 2016 
Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He 
655 660 665 670 

ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC CTT 2064 
Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu 
675 680 685 

AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT GTT 2112 
Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val 
690 695 700 



CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT TCT 
Gin Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp Ser 
705 710 715 



2160 



ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA AAT 2208 
lie He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn 
720 725 730 

ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA CCT 2256 
He Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie Pro 
735 740 745 750 

CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG ATT 23 04 

His He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He 
755 760 765 

CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA ATT 2352 
Pro Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He 
770 775 780 

TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA ATC 2400 
Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He 
785 790 795 

TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG AAA 2448 
Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys 
800 805 810 

ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT GCT 24 96 

He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala 
815 820 825 830 

GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT ATT 2544 
Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He 
835 840 845 

GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG TCC 2592 
Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser 
850 855 860 

AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA ATG 2640 
Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met 
865 870 875 

CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA GAG 2688 
Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu 
880 885 890 

AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG 2 741 

Lys 

895 

TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC 2801 



TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC 2 861 

ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA 2 921 

TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT 2 981 

AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT 3041 

TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT 3101 

TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA 3161 

TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA 3221 

ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT 32 81 

ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT 3341 

AACCATATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT 3401 

TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3455 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 895 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val 
15 10 

Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys 
20 25 

lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr 
35 40 

Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin 
50 55 60 

Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp 
65 70 75 

Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu 
85 90 

Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser 
100 105 



Arg Leu Glu Phe 
15 

Gin Lys Leu Lys 
30 

Trp Glu Lys Val 
45 

Lys Lys Lys Glu 



Leu Asp Glu Met 
80 

He Ser Val His 
95 

Thr Lys Val Asp 
110 



Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala Leu 
115 120 125 



Phe Ser Lys Leu Glu 
130 

Ser Ser Ser lie Ser 
145 

Ser Trp lie Thr Phe 
165 

Asp Asp Leu Val lie 
180 

Phe lie Lys Leu Ser 
195 

Ala Val He Pro He 
210 

Asn Arg Ser Ala Arg 
225 

He Glu Val Leu Cys 
245 

Asn Val Tyr Phe Lys 
260 

Val Thr Ser Asn Gly 
275 

Glu Glu He Tyr Leu 
290 

Asp His Asp Lys Thr 
305 

Gin Arg Thr Pro Arg 
325 

Pro Pro His Thr Pro 
340 

Met Met He Leu Asn 
355 

Ser Tyr Phe Asn Asn 
370 



Arg Thr Cys Glu Leu He 
135 

Thr Glu He Asn Ser Ala 
150 155 

Leu Leu Ala Lys Gly Glu 
170 

Ser Phe Gin Leu Met Leu 
185 

Pro Pro Met Leu Leu Lys 
200 

Asn Gly Ser Pro Arg Thr 
215 

He Ala Lys Gin Leu Glu 
230 235 

Lys Glu His Glu Cys Asn 
250 

Asn Phe He Pro Phe Met 
265 

Leu Pro Glu Val Glu Asn 
280 

Lys Asn Lys Asp Leu Asp 
295 

Leu Gin Thr Asp Ser He 
310 315 

Lys Ser Asn Leu Asp Glu 
330 

Val Arg Thr Val Met Asn 
345 

Ser Ala Ser Asp Gin Pro 
360 

Cys Thr Val Asn Pro Lys 
375 



Tyr Leu Thr Gin Pro 
140 

Leu Val Leu Lys Val 
160 

Val Leu Gin Met Glu 
175 

Cys Val Leu Asp Tyr 
190 

Glu Pro Tyr Lys Thr 
205 

Pro Arg Arg Gly Gin 
220 

Asn Asp Thr Arg He 
240 

He Asp Glu Val Lys 
255 

Asn Ser Leu Gly Leu 
270 

Leu Ser Lys Arg Tyr 
285 

Ala Arg Leu Phe Leu 
300 

Asp Ser Phe Glu Thr 
320 

Glu Val Asn Val He 
335 

Thr He Gin Gin Leu 
350 

Ser Glu Asn Leu He 
365 

Glu Ser He Leu Lys 
380 



Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala 
385 390 395 400 



Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly 
405 410 415 



Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu 
420 425 430 

Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn He 
435 440 445 

Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr 
450 455 460 

Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser 
465 470 475 480 

Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr 
485 490 495 

Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg Glu 
500 505 510 

Met He Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser Leu 
515 520 525 

Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser Lys 
530 535 540 

Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn 
545 550 555 560 

Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro 
565 570 575 

Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr 
580 585 590 

Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro 
595 600 605 

Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu 
610 615 620 

Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His 
625 630 635 640 

Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His Thr Leu Gin 
645 650 655 

Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He Met Met 
660 665 670 



Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys Phe 
675 680 685 



Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu 
690 695 700 



Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He He 
705 710 715 720 

Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He Leu 
725 730 735 

Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro His He 
740 745 750 

Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro Gly 
755 760 765 

Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser Glu 
770 775 780 

Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu Val 
78 5 790 795 800 

Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He Asn 
805 810 815 

Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly 
820 825 830 

Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu Gly 
835 840 845 

Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe 
850 855 860 

Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys 
865 870 875 880 

Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
885 890 895 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 7.. 2628 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



GCCATC ATG GAT TTT ACT GCA TTA TGT CAG AAA TTA AAG ATA CCA GAT 48 
Met Asp Phe Thr Ala Leu Cys Gin Lys Leu Lys lie Pro Asp 
15 10 

CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG GAG AAA GTT TCA TCT GTG 96 
His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys Val Ser Ser Val 
15 20 25 30 

GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG AAA AAG GAA CTG TGG GGA 144 
Asp Gly Val Leu Gly Gly Tyr lie Gin Lys Lys Lys Glu Leu Trp Gly 
35 40 45 

ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA GAT GAG ATG TCG TTC ACT 192 
lie Cys lie Phe lie Ala Ala Val Asp Leu Asp Glu Met Ser Phe Thr 
50 55 60 

TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC AGT GTC CAT AAA TTC TTT 240 
Phe Thr Glu Leu Gin Lys Asn lie Glu lie Ser Val His Lys Phe Phe 
65 70 75 

AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC AAA GTT GAT AAT GCT ATG 288 
Asn Leu Leu Lys Glu lie Asp Thr Ser Thr Lys Val Asp Asn Ala Met 
80 85 90 

TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG TTT GCA CTC TTC AGC AAA 336 
Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala Leu Phe Ser Lys 
95 100 105 110 

TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG ACA CAA CCC AGC AGT TCG 384 
Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr Gin Pro Ser Ser Ser 
115 120 125 

ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG CTA AAA GTT TCT TGG ATC 432 
lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu Lys Val Ser Trp lie 
130 135 140 

ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA CAA ATG GAA GAT GAT CTG 480 
Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu Asp Asp Leu 
145 150 155 

GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC CTT GAC TAT TTT ATT AAA 528 
Val lie Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr Phe lie Lys 
160 165 170 

CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA TAT AAA ACA GCT GTT ATA 576 
Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr Ala Val lie 
175 180 185 190 

CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG CGA GGT CAG AAC AGG AGT 624 
Pro lie Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg Ser 
195 200 205 



GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT AC A AGA ATT ATT GAA GTT 672 
Ala Arg lie Ala Lys Gin Leu Glu Asn Asp Thr Arg lie lie Glu Val 
210 215 220 

CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT GAG GTG AAA AAT GTT TAT 720 
Leu Cys Lys Glu His Glu Cys Asn lie Asp Glu Val Lys Asn Val Tyr 
225 230 235 

TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT CTT GGA CTT GTA ACA TCT 768 
Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu Val Thr Ser 
240 245 250 

AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT AAA CGA TAC GAA GAA ATT 816 
Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu He 
255 260 265 270 

TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA TTA TTT TTG GAT CAT GAT 864 
Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His Asp 
275 280 285 

AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT TTT GAA ACA CAG AGA ACA 912 
Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu Thr Gin Arg Thr 
290 295 300 

CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG AAT GTA ATT CCT CCA CAC 96 0 

Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val He Pro Pro His 
305 310 315 

ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC CAA CAA TTA ATG ATG ATT 1008 
Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin Leu Met Met He 
320 325 330 

TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA AAT CTG ATT TCC TAT TTT 1056 
Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu He Ser Tyr Phe 
335 340 345 350 

AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT ATA CTG AAA AGA GTG AAG 1104 
Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu Lys Arg Val Lys 
355 360 365 

GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT GCT AAA GCT GTG GGA CAG 1152 
Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala Val Gly Gin 
370 375 380 

GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC AAA CTT GGA GTT CGC TTG 1200 
Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg Leu 
385 390 395 

TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA TCA GAA GAA GAA CGA TTA 1248 
Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg Leu 
400 405 410 



TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT GAC AAC ATT TTT CAT ATG 12 96 

Ser lie Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn lie Phe His Met 
415 420 425 430 

TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA ATG GCC ACA TAT AGC AGA 1344 
Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser Arg 
435 440 445 

AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA GAT TTG TCT TTC CCA TGG 13 92 

Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro Trp 
450 455 460 

ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT GAT TTT TAC AAA GTG ATC 1440 
lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val lie 
465 470 475 

GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG ACA AGA GAA ATG ATA AAA 1488 
Glu Ser Phe lie Lys Ala Glu Gly Asn Leu Thr Arg Glu Met lie Lys 
480 485 490 

CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG GAA TCC CTT GCA TGG CTC 1536 
His Leu Glu Arg Cys Glu His Arg lie Met Glu Ser Leu Ala Trp Leu 
495 500 505 510 

TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA CAA TCA AAG GAC CGA GAA 1584 
Ser Asp Ser Pro Leu Phe Asp Leu lie Lys Gin Ser Lys Asp Arg Glu 
515 520 525 

GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT CCT CTT AAT CTT CCT CTC 1632 
Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro Leu 
530 535 540 

CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT CTT TCT CCT GTA AGA TCT 1680 
Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg Ser 
545 550 555 

CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA AAT TCT ACT GCA AAT GCA 1728 
Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn Ala 
560 565 570 

GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC CAG AAG CCA TTG AAA TCT 1776 
Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys Ser 
575 580 585 590 

ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG TAT CGG CTA GCC TAT CTC 1824 
Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr Leu 
595 600 605 

CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG TCT GAG CAC CCA GAA TTA 1872 
Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His Pro Glu Leu 
610 615 620 



GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC ACC CTG CAG AAT GAG TAT 1920 
Glu His lie lie Trp Thr Leu Phe Gin His Thr Leu Gin Asn Glu Tyr 
625 630 635 

GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA ATT ATG ATG TGT TCC ATG 1968 
Glu Leu Met Arg Asp Arg His Leu Asp Gin lie Met Met Cys Ser Met 
640 645 650 

TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC CTT AAA TTC AAA ATC ATT 2 016 

Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys Phe Lys He He 
655 660 665 670 

GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT GTT CAG GAG ACA TTC AAA 2064 
Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe Lys 
675 680 685 

CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT TCT ATT ATA GTA TTC TAT 2112 
Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He He Val Phe Tyr 
690 695 700 

AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA AAT ATT TTG CAG TAT GCT 2160 
Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He Leu Gin Tyr Ala 
705 710 715 

TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA CCT CAC ATT CCT CGA AGC 22 08 

Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro His He Pro Arg Ser 
720 725 730 

CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG ATT CCT GGA GGG AAC ATC 2256 
Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro Gly Gly Asn He 
735 740 745 750 

TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA ATT TCA GAA GGT CTG CCA 23 04 

Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser Glu Gly Leu Pro 
755 760 765 

ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA ATC TTA GTA TCA ATT GGT 2352 
Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu Val Ser He Gly 
770 775 780 

GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG AAA ATA AAT CAG ATG GTA 2400 
Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He Asn Gin Met Val 
785 790 795 

TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT GCT GAA GGA AGC AAC CCT 2448 
Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly Ser Asn Pro 
800 805 810 

CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT ATT GAA GGA TCA GAT GAA 2496 
Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu Gly Ser Asp Glu 
815 820 825 830 



GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG TCC AAA TTT CAG CAG AAA 
Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe Gin Gin Lys 
835 840 845 



2544 



CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA ATG CAA AAG CAG AAA ATG 
Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys Gin Lys Met 
850 855 860 



2592 



AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA GAG AAA TGAGGATCTC 
Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
865 870 



2638 



AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG TCTCTCACAG ATGTGACTGT 2698 

ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC TTCAGCTCTT TTTGTGGATA 2758 

TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC ACTTGAAATG TTAGTCATTG 2818 

TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA TTTAAAAAGT TGTAGCAGAT 2878 

TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT AGTAAGAATG GCCCTAGAGT 2938 

GGGAGTCCTG AT AAC C C AGG CCTGTCTGAC TACTTTGCCT TCTTTTGTAG CATATAGGTG 2998 

ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT TTAATTTAAC ATGAACACCC 3058 

TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA TTGACTGCCC ATTCACCAAA 3118 

ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA ATTAGAAAAA AATTACTAAT 3178 

TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT ACTGTGTGCT TGTTTTATAA 323 8 

AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT AAC C ATATGA TACTATCATA 3298 

CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT TACTGATTAT TTTCTTCATC 3358 

CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3392 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 874 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: protein 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



Met Asp Phe Thr Ala Leu Cys Gin Lys Leu Lys lie Pro Asp His Val 
15 10 15 



Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys Val Ser Ser Val Asp Gly 
20 25 30 



Val Leu Gly Gly 
35 

He Phe He Ala 
50 

Glu Leu Gin Lys 
65 

Leu Lys Glu He 



Leu Leu Lys Lys 
100 

Arg Thr Cys Glu 
115 

Thr Glu He Asn 
130 

Leu Leu Ala Lys 
145 

Ser Phe Gin Leu 



Pro Pro Met Leu 
180 

Asn Gly Ser Pro 
195 

He Ala Lys Gin 
210 

Lys Glu His Glu 
225 

Asn Phe He Pro 



Leu Pro Glu Val 
260 

Lys Asn Lys Asp 
275 



Tyr He Gin Lys 
40 

Ala Val Asp Leu 
55 

Asn He Glu He 
70 

Asp Thr Ser Thr 
85 

Tyr Asp Val Leu 



Leu He Tyr Leu 
120 

Ser Ala Leu Val 
135 

Gly Glu Val Leu 
150 

Met Leu Cys Val 
165 

Leu Lys Glu Pro 



Arg Thr Pro Arg 
200 

Leu Glu Asn Asp 
215 

Cys Asn He Asp 
230 

Phe Met Asn Ser 
245 

Glu Asn Leu Ser 



Leu Asp Ala Arg 
280 



Lys Lys Glu Leu 



Asp Glu Met Ser 
60 

Ser Val His Lys 
75 

Lys Val Asp Asn 
90 

Phe Ala Leu Phe 
105 

Thr Gin Pro Ser 



Leu Lys Val Ser 
140 

Gin Met Glu Asp 
155 

Leu Asp Tyr Phe 
170 

Tyr Lys Thr Ala 
185 

Arg Gly Gin Asn 



Thr Arg He He 
220 

Glu Val Lys Asn 
235 

Leu Gly Leu Val 
250 

Lys Arg Tyr Glu 
265 

Leu Phe Leu Asp 



Trp Gly He Cys 
45 

Phe Thr Phe Thr 



Phe Phe Asn Leu 
80 

Ala Met Ser Arg 
95 

Ser Lys Leu Glu 
110 

Ser Ser He Ser 
125 

Trp He Thr Phe 



Asp Leu Val He 
160 

He Lys Leu Ser 
175 

Val He Pro He 
190 

Arg Ser Ala Arg 
205 

Glu Val Leu Cys 



Val Tyr Phe Lys 
240 

Thr Ser Asn Gly 
255 

Glu He Tyr Leu 
270 

His Asp Lys Thr 
285 



Leu Gin Thr Asp Ser He Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg 
290 295 300 



Lys Ser Asn Leu 
305 

Val Arg Thr Val 



Ser Ala Ser Asp 
340 

Cys Thr Val Asn 
355 

Gly Tyr lie Phe 
370 

Val Glu He Gly 
385 

Arg Val Met Glu 



Gin Asn Phe Ser 
420 

Leu Ala Cys Ala 
435 

Ser Gin Asn Leu 
450 



Asp Glu Glu Val 
310 

Met Asn Thr He 
325 

Gin Pro Ser Glu 



Pro Lys Glu Ser 
360 

Lys Glu Lys Phe 
375 

Ser Gin Arg Tyr 
390 

Ser Met Leu Lys 
405 

Lys Leu Leu Asn 



Leu Glu Val Val 
440 

Asp Ser Gly Thr 
455 



Asn Val He Pro 
315 

Gin Gin Leu Met 
330 

Asn Leu He Ser 
345 

He Leu Lys Arg 



Ala Lys Ala Val 
380 

Lys Leu Gly Val 
395 

Ser Glu Glu Glu 
410 

Asp Asn He Phe 
425 

Met Ala Thr Tyr 



Asp Leu Ser Phe 
460 



Pro His Thr Pro 
320 

Met He Leu Asn 
335 

Tyr Phe Asn Asn 
350 

Val Lys Asp He 
365 

Gly Gin Gly Cys 



Arg Leu Tyr Tyr 
400 

Arg Leu Ser He 
415 

His Met Ser Leu 
430 

Ser Arg Ser Thr 
445 

Pro Trp He Leu 



Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val He Glu Ser 
465 470 475 480 

Phe He Lys Ala Glu Gly Asn Leu Thr Arg Glu Met He Lys His Leu 
485 490 495 

Glu Arg Cys Glu His Arg He Met Glu Ser Leu Ala Trp Leu Ser Asp 
500 505 510 



Ser Pro Leu Phe Asp Leu He Lys 
515 520 

Thr Asp His Leu Glu Ser Ala Cys 
530 535 

Asn His Thr Ala Ala Asp Met Tyr 
545 550 

Lys Lys Gly Ser Thr Thr Arg Val 
565 

Gin Ala Thr Ser Ala Phe Gin Thr 
580 



Gin Ser Lys Asp Arg Glu Gly Pro 
525 

Pro Leu Asn Leu Pro Leu Gin Asn 
540 

Leu Ser Pro Val Arg Ser Pro Lys 
555 560 

Asn Ser Thr Ala Asn Ala Glu Thr 
570 575 

Gin Lys Pro Leu Lys Ser Thr Ser 
585 590 



Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu 
595 600 605 



Asn Thr Leu Cys Glu 
610 

lie lie Trp Thr Leu 
625 

Met Arg Asp Arg His 
645 

lie Cys Lys Val Lys 
660 

Ala Tyr Lys Asp Leu 
675 

Leu lie Lys Glu Glu 
690 

Val Phe Met Gin Arg 
705 

Arg Pro Pro Thr Leu 
725 

Lys Phe Pro Ser Ser 
74 0 

Ser Pro Leu Lys Ser 
755 

Thr Lys Met Thr Pro 
770 

Phe Gly Thr Ser Glu 
785 

Ser Asp Arg Val Leu 
805 

Pro Leu Lys Lys Leu 
820 

Gly Ser Lys His Leu 
835 

Glu Met Thr Ser Thr 
850 

Ser Met Asp Thr Ser 
865 



Arg Leu Leu Ser Glu His 
615 

Phe Gin His Thr Leu Gin 
630 635 

Leu Asp Gin lie Met Met 
650 

Asn lie Asp Leu Lys Phe 
665 

Pro His Ala Val Gin Glu 
680 

Glu Tyr Asp Ser lie lie 
695 

Leu Lys Thr Asn lie Leu 
710 715 

Ser Pro lie Pro His lie 
730 

Pro Leu Arg lie Pro Gly 
745 

Pro Tyr Lys lie Ser Glu 
760 

Arg Ser Arg lie Leu Val 
775 

Lys Phe Gin Lys lie Asn 
790 795 

Lys Arg Ser Ala Glu Gly 
810 

Arg Phe Asp lie Glu Gly 
825 

Pro Gly Glu Ser Lys Phe 
840 

Arg Thr Arg Met Gin Lys 
855 

Asn Lys Glu Glu Lys 
870 



Pro Glu Leu Glu His 
620 

Asn Glu Tyr Glu Leu 
640 

Cys Ser Met Tyr Gly 
655 

Lys He He Val Thr 
670 

Thr Phe Lys Arg Val 
685 

Val Phe Tyr Asn Ser 
700 

Gin Tyr Ala Ser Thr 
720 

Pro Arg Ser Pro Tyr 
735 

Gly Asn He Tyr He 
750 

Gly Leu Pro Thr Pro 
765 

Ser He Gly Glu Ser 
780 

Gin Met Val Cys Asn 
800 

Ser Asn Pro Pro Lys 
815 

Ser Asp Glu Ala Asp 
830 

Gin Gin Lys Leu Ala 
845 

Gin Lys Met Asn Asp 
860 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 2559 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

GCCATC ATG GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT 48 
Met Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr 



1 



5 



10 



ATT CAA AAG AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA 
lie Gin Lys Lys Lys Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala 
15 20 25 30 



96 



GTT GAC CTA GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC 
Val Asp Leu Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn 
35 40 45 



144 



ATA GAA ATC AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT 
lie Glu lie Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp 
50 55 60 



192 



ACC AGT ACC AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT 
Thr Ser Thr Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr 
65 70 75 



240 



GAT GTA TTG TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT 
Asp Val Leu Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu 
80 85 90 



288 



ATA TAT TTG ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT 
lie Tyr Leu Thr Gin Pro Ser Ser Ser lie Ser Thr Glu He Asn Ser 
95 100 105 110 



336 



GCA TTG GTG CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG 
Ala Leu Val Leu Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly 
115 120 125 



384 



GAA GTA TTA CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG 
Glu Val Leu Gin Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met 
130 135 140 



432 



CTA TGT GTC CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC 
Leu Cys Val Leu Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu 
145 150 155 



480 



AAA GAA CCA TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA 528 
Lys Glu Pro Tyr Lys Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg 
160 165 170 

ACA CCC AGG CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA 576 
Thr Pro Arg Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu 
175 180 185 190 

GAA AAT GAT ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT 624 
Glu Asn Asp Thr Arg lie lie Glu Val Leu Cys Lys Glu His Glu Cys 
195 200 205 

AAT ATA GAT GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT 672 
Asn lie Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe 
210 215 220 

ATG AAT TCT CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA 720 
Met Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu 
225 230 235 

AAT CTT TCT AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA 768 
Asn Leu Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu 
240 245 250 

GAT GCA AGA TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT 816 
Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser 
255 260 265 270 

ATA GAC AGT TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT 864 
lie Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp 
275 280 285 

GAA GAG GTG AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG 912 
Glu Glu Val Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met 
290 295 300 

AAC ACT ATC CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA 960 
Asn Thr lie Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin 
305 310 315 

CCT TCA GAA AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA 1008 
Pro Ser Glu Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro 
320 325 330 

AAA GAA AGT ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA 1056 
Lys Glu Ser lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys 
335 340 345 350 

GAG AAA TTT GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA 1104 
Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser 
355 360 365 



CAG CGA TAC AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC 1152 
Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser 
370 375 380 

ATG CTT AAA TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA 1200 
Met Leu Lys Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys 
385 390 395 

CTT CTG AAT GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT 1248 
Leu Leu Asn Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu 
400 405 410 

GAG GTT GTA ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT 1296 
Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp 
415 420 425 430 

TCT GGA ACA GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA 1344 
Ser Gly Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu 
435 440 445 

AAA GCC TTT GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA 1392 
Lys Ala Phe Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu 
450 455 460 

GGC AAC TTG ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT 1440 
Gly Asn Leu Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His 
465 470 475 

CGA ATC ATG GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT 1488 
Arg He Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp 
480 485 490 

CTT ATT AAA CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA 1536 
Leu He Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu 
495 500 505 510 

TCT GCT TGT CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA 1584 
Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala 
515 520 525 

GAT ATG TAT CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT 1632 
Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr 
530 535 540 

ACG CGT GTA AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC 16 80 

Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala 
545 550 555 

TTC CAG ACC CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT 1728 
Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr 
560 565 570 



AAA AAA GTG TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA 1776 
Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu 
575 580 585 590 

CGC CTT CTG TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT 1824 
Arg Leu Leu Ser Glu His Pro Glu Leu Glu His He He Trp Thr Leu 
595 600 605 

TTC CAG CAC ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT 1872 
Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His 
610 615 620 

TTG GAC CAA ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG 192 0 

Leu Asp Gin He Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys 
625 630 635 

AAT ATA GAC CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT 1968 
Asn He Asp Leu Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu 
640 645 650 

CCT CAT GCT GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG 2016 
Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu 
655 660 665 670 

GAG TAT GAT TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA 2064 
Glu Tyr Asp Ser He He Val Phe Tyr Asn Ser Val Phe Met Gin Arg 
675 680 685 

CTG AAA ACA AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG 2112 
Leu Lys Thr Asn He Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu 
690 695 700 

TCA CCA ATA CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA 2160 
Ser Pro He Pro His He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser 
705 710 715 

CCC TTA CGG ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT 2208 
Pro Leu Arg He Pro Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser 
720 725 730 

CCA TAT AAA ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA 2256 
Pro Tyr Lys He Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro 
735 740 745 750 

AGA TCA AGA ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG 23 04 

Arg Ser Arg He Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu 
755 760 765 

AAG TTC CAG AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC 2352 
Lys Phe Gin Lys He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu 
770 775 780 



AAA AGA AGT GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA 
Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu 
785 790 795 



2400 



CGC TTT GAT ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC 
Arg Phe Asp lie Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu 
800 805 810 



2448 



CCA GGA GAG TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT 
Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr 
815 820 825 830 



2496 



CGA ACA CGA ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA 
Arg Thr Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser 
835 840 845 



2544 



AAC AAG GAA GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT 



2599 



Asn Lys Glu Glu Lys 
850 

GGATTCATTG TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC 2659 

ATTTAATATC TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA 2719 

TTCCTAAGCC ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA 2 779 

AATCCTGCCA TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG 283 9 

CTTTATGGAT AGTAAGAATG GCC CTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC 2899 

TACTTTGCCT TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT 2959 

GTATATTTTT TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA 3019 

TGCAATTTGA TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT 3 079 

ATTATTAGAA ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG 3139 

AATCTGATAT ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA 3199 

AGCAAAGTAT AACCATATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG 3259 

TAAAAGAACT TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA 3319 

TAGT 3323 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin 
15 10 15 

Lys Lys Lys Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp 
20 25 30 

Leu Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu 
35 40 45 

lie Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser 
50 55 60 

Thr Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val 
65 70 75 80 

Leu Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr 
85 90 95 

Leu Thr Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu 
100 105 110 

Val Leu Lys Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val 
115 120 125 

Leu Gin Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys 
130 135 140 

Val Leu Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu 
145 150 155 160 

Pro Tyr Lys Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro 
165 170 175 

Arg Arg Gly Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn 
180 185 190 

Asp Thr Arg He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He 
195 200 205 



Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn 
210 215 220 



Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu 
225 230 235 240 



Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala 
245 250 255 

Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp 
260 265 270 

Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu 
275 280 285 

Val Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr 
290 295 300 

lie Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser 
305 310 315 320 

Glu Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu 
325 330 335 

Ser lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys 
340 345 350 

Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg 
355 360 365 

Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu 
370 375 380 

Lys Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu 
385 390 395 400 

Asn Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val 
405 410 415 

Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly 
420 425 430 

Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala 
435 440 445 

Phe Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn 
450 455 460 

Leu Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He 
465 470 475 480 

Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He 
485 490 495 



Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala 
500 505 510 



Cys Pro Leu Asn Leu Pro Leu Gin Asn Asia His Thr Ala Ala Asp Met 
515 520 525 



Tyr Leu Ser Pro Val 
530 

Val Asn Ser Thr Ala 
545 

Thr Gin Lys Pro Leu 
565 

Val Tyr Arg Leu Ala 
580 

Leu Ser Glu His Pro 
595 

His Thr Leu Gin Asn 
610 

Gin lie Met Met Cys 
625 

Asp Leu Lys Phe Lys 
645 

Ala Val Gin Glu Thr 
660 

Asp Ser lie lie Val 
675 

Thr Asn lie Leu Gin 
690 

lie Pro His He Pro 
705 

Arg He Pro Gly Gly 
725 

Lys He Ser Glu Gly 
740 

Arg He Leu Val Ser 
755 

Gin Lys He Asn Gin 
770 



Arg Ser Pro Lys Lys Lys 
535 

Asn Ala Glu Thr Gin Ala 
550 555 

Lys Ser Thr Ser Leu Ser 
570 

Tyr Leu Arg Leu Asn Thr 
585 

Glu Leu Glu His He He 
600 

Glu Tyr Glu Leu Met Arg 
615 

Ser Met Tyr Gly He Cys 
630 635 

He He Val Thr Ala Tyr 
650 

Phe Lys Arg Val Leu He 
665 

Phe Tyr Asn Ser Val Phe 
680 

Tyr Ala Ser Thr Arg Pro 
695 

Arg Ser Pro Tyr Lys Phe 
710 715 

Asn He Tyr He Ser Pro 
730 

Leu Pro Thr Pro Thr Lys 
745 

He Gly Glu Ser Phe Gly 
760 

Met Val Cys Asn Ser Asp 

775 



Gly Ser Thr Thr Arg 
540 

Thr Ser Ala Phe Gin 
560 

Leu Phe Tyr Lys Lys 
575 

Leu Cys Glu Arg Leu 
590 

Trp Thr Leu Phe Gin 
605 

Asp Arg His Leu Asp 
620 

Lys Val Lys Asn He 
640 

Lys Asp Leu Pro His 
655 

Lys Glu Glu Glu Tyr 
670 

Met Gin Arg Leu Lys 
685 

Pro Thr Leu Ser Pro 
700 

Pro Ser Ser Pro Leu 
720 

Leu Lys Ser Pro Tyr 
735 

Met Thr Pro Arg Ser 
750 

Thr Ser Glu Lys Phe 
765 

Arg Val Leu Lys Arg 
780 



Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe 
785 790 795 800 



Asp lie Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly 
805 810 815 



Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr 
820 825 830 



Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys 
835 840 845 



Glu Glu Lys 
850 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3266 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 2502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GCCATC ATG CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 48 



Met Leu Trp Gly lie Cys lie 
1 5 



Phe lie Ala Ala Val Asp Leu 
10 



GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 
Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu He 
15 20 25 30 



96 



AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 
Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr 
35 40 45 



144 



AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 
Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
50 55 60 



192 



TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 
Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu 
65 70 75 



240 



ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG 
Thr Gin Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val 
80 85 90 



288 



CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA 
Leu Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu 
95 100 105 110 



336 



CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC 384 
Gin Met Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val 
115 120 125 

CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA 432 
Leu Asp Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro 
130 135 140 

TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG 480 
Tyr Lys Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg 
145 150 155 

CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT 528 
Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu Glu Asn Asp 
160 165 170 

ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT 576 
Thr Arg lie lie Glu Val Leu Cys Lys Glu His Glu Cys Asn lie Asp 
175 180 185 190 

GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT 624 
Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn Ser 
195 200 205 

CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT 672 
Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser 
210 215 220 

AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA 720 
Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg 
225 230 235 

TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT 768 
Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser 
240 245 250 

TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG 816 
Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val 
255 260 265 270 

AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC 864 
Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie 
275 280 285 

CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA 912 
Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu 
290 295 300 

AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT 960 
Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser 
305 310 315 



ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT 1008 
lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys Phe 
320 325 330 

GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC 1056 
Ala Lys Ala Val Gly Gin Gly Cys Val Glu lie Gly Ser Gin Arg Tyr 
335 340 345 350 

AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA 1104 
Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys 
355 360 365 

TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT 1152 
Ser Glu Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys Leu Leu Asn 
370 375 380 

GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA 12 0 0 

Asp Asn lie Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val 
385 390 395 

ATG GCC AC A TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA 1248 
Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr 
400 405 410 

GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT 1296 
Asp Leu Ser Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe 
415 420 425 430 

GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG 1344 
Asp Phe Tyr Lys Val lie Glu Ser Phe lie Lys Ala Glu Gly Asn Leu 
435 440 445 

ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG 13 92 

Thr Arg Glu Met lie Lys His Leu Glu Arg Cys Glu His Arg lie Met 
450 455 460 

GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA 1440 
Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys 
465 470 475 

CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT 1488 
Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys 
480 485 490 

CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT 1536 
Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr 
495 500 505 510 

CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA 1584 
Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val 
515 520 525 



AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC 1632 
Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr 
530 535 540 

CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG 1680 
Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val 
545 550 555 

TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG 1728 
Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu 
560 565 570 

TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC 17 76 

Ser Glu His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His 
575 580 585 590 

ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA 1824 
Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin 
595 600 605 

ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC 1872 
lie Met Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie Asp 
610 615 620 

CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT 1920 
Leu Lys Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His Ala 
625 630 635 

GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT 1968 
Val Gin Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp 
640 645 650 

TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA 2 016 

Ser lie lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr 
655 660 665 670 

AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA 2064 
Asn lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie 
675 680 685 

CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG 2112 
Pro His lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg 
690 695 700 

ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA 2160 
lie Pro Gly Gly Asn lie Tyr lie Ser Pro Leu Lys Ser Pro Tyr Lys 
705 710 715 

ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA 22 08 

lie Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg 
720 725 730 



ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG 2256 
lie Leu Val Ser lie Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin 
735 740 745 750 

AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT 23 04 

Lys lie Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser 
755 760 765 

GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT 2352 
Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp 
770 775 780 

ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG 2400 
lie Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu 
785 790 795 

TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA 2448 
Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg 
800 805 810 

ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA 2496 
Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu 
815 820 825 830 

GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG 2552 
Glu Lys 



TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC 2612 

TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC 2672 

ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA 2 732 

TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT 2792 

AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT 2 852 

TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT 2912 

TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA 2 972 

TTGACTGCCC ATTCAC C AAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA 3032 

ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT 3 092 

ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT 3152 

AAC CAT ATG A TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT 3212 

TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3266 



(2) INFORMATION FOR SEQ ID NO: 35: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp Leu Asp Glu 
15 10 15 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu lie Ser Val 
20 25 30 

His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser Thr Lys Val 
35 40 45 

Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
50 55 60 

Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr Gin 
65 70 75 80 

Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu Lys 
85 90 95 

Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
100 105 110 

Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu Asp 
115 120 125 

Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys 
130 135 140 

Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly 
145 150 155 160 

Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg 
165 170 175 

He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val 
180 185 190 

Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly 
195 200 205 



Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg 
210 215 220 



Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe 
225 230 235 240 



Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe Glu 
245 250 255 

Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val 
260 265 270 

lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin Gin 
275 280 285 

Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu 
290 295 300 

lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie Leu 
305 310 315 320 

Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys Phe Ala Lys 
325 330 335 

Ala Val Gly Gin Gly Cys Val Glu lie Gly Ser Gin Arg Tyr Lys Leu 
340 345 350 

Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu 
355 360 365 

Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn 
370 375 380 

lie Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala 
385 390 395 400 

Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu 
405 410 415 

Ser Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe 
420 425 430 

Tyr Lys Val lie Glu Ser Phe lie Lys Ala Glu Gly Asn Leu Thr Arg 
435 440 445 

Glu Met lie Lys His Leu Glu Arg Cys Glu His Arg lie Met Glu Ser 
450 455 460 

Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys Gin Ser 
465 470 475 480 

Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu 
485 490 495 



Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser 
500 505 510 



Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser 
515 520 525 



Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys 
530 535 540 

Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg 
545 550 555 560 

Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu 
565 570 575 

His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His Thr Leu 
580 585 590 

Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin lie Met 
595 600 605 

Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie Asp Leu Lys 
610 615 620 

Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin 
625 630 635 640 

Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He 
645 650 655 

He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He 
660 665 670 

Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro His 
675 680 685 

He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro 
690 695 700 

Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser 
705 710 715 720 

Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu 
725 730 735 

Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He 
740 745 750 

Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu 
755 760 765 

Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu 
770 775 780 



Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys 
785 790 795 800 



Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin 
805 810 815 



Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
820 825 830 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3113 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 7.. 2349 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GCCGTC ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG TTT GCA CTC 48 



Met Ser Arg Leu Leu Lys Lys 
1 5 



Tyr Asp Val Leu Phe Ala Leu 
10 



TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG ACA CAA CCC 
Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr Gin Pro 
15 20 25 30 



96 



AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG CTA AAA GTT 
Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu Lys Val 
35 40 45 



144 



TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA CAA ATG GAA 
Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu 
50 55 60 



192 



GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC CTT GAC TAT 
Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr 
65 70 75 



240 



TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA TAT AAA ACA 
Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr 
80 85 90 



288 



GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG CGA GGT CAG 
Ala Val lie Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin 
95 100 105 110 



336 



AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT ACA AGA ATT 
Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg He 
115 120 125 



384 



ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT GAG GTG AAA 432 
lie Glu Val Leu Cys Lys Glu His Glu Cys Asn lie Asp Glu Val Lys 
130 135 140 

AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT CTT GGA CTT 480 
Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn Ser Leu Gly Leu 
145 150 155 

GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT AAA CGA TAC 528 
Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr 
160 165 170 

GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA TTA TTT TTG 576 
Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu 
175 180 185 190 

GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT TTT GAA ACA 624 
Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe Glu Thr 
195 200 205 

CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG AAT GTA ATT 672 
Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val lie 
210 215 220 

CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC CAA CAA TTA 720 
Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin Gin Leu 
225 230 235 

ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA AAT CTG ATT 768 
Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu lie 
240 245 250 

TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT ATA CTG AAA 816 
Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie Leu Lys 
255 260 265 270 

AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT GCT AAA GCT 864 
Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys Phe Ala Lys Ala 
275 280 285 

GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC AAA CTT GGA 912 
Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly 
290 295 300 

GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA TCA GAA GAA 960 
Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu 
305 310 315 

GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT GAC AAC ATT 1008 
Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn He 
320 325 330 



TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA ATG GCC ACA 1056 
Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr 
335 340 345 350 

TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA GAT TTG TCT 1104 
Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser 
355 360 365 

TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT GAT TTT TAC 1152 
Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr 
370 375 380 

AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG ACA AGA GAA 12 00 

Lys Val lie Glu Ser Phe lie Lys Ala Glu Gly Asn Leu Thr Arg Glu 
385 390 395 

ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG GAA TCC CTT 1248 
Met lie Lys His Leu Glu Arg Cys Glu His Arg lie Met Glu Ser Leu 
400 405 410 

GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA CAA TCA AAG 1296 
Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie Lys Gin Ser Lys 
415 420 425 430 

GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT CCT CTT AAT 1344 
Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn 
435 440 445 

CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT CTT TCT CCT 1392 
Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro 
450 455 460 

GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA AAT TCT ACT 1440 
Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr 
465 470 475 

GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC CAG AAG CCA 1488 
Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro 
480 485 490 

TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG TAT CGG CTA 1536 
Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu 
495 500 505 510 

GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG TCT GAG CAC 1584 
Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His 
515 520 525 

CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC ACC CTG CAG 1632 
Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His Thr Leu Gin 
530 535 540 



AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA ATT ATG ATG 1680 
Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin lie Met Met 
545 550 555 

TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC CTT AAA TTC 1728 
Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie Asp Leu Lys Phe 
560 565 570 

AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT GTT CAG GAG 1776 
Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu 
575 580 585 590 

ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT TCT ATT ATA 1824 
Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp Ser lie lie 
595 600 605 

GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA AAT ATT TTG 1872 
Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn lie Leu 
610 615 620 

CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA CCT CAC ATT 1920 
Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro Tie Pro His lie 
625 630 635 

CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG ATT CCT GGA 1968 
Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg lie Pro Gly 
640 645 650 

GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA ATT TCA GAA 2 016 

Gly Asn lie Tyr lie Ser Pro Leu Lys Ser Pro Tyr Lys lie Ser Glu 
655 660 665 670 

GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA ATC TTA GTA 2064 
Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg lie Leu Val 
675 680 685 

TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG AAA ATA AAT 2112 
Ser lie Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys lie Asn 
690 695 700 

CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT GCT GAA GGA 2160 
Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly 
705 710 715 

AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT ATT GAA GGA 2208 
Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp lie Glu Gly 
720 725 730 

TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG TCC AAA TTT 2256 
Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe 
735 740 745 750 



CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA ATG CAA AAG 2304 
Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys 
755 760 765 

CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA GAG AAA 2349 
Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
770 775 780 

TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG TCTCTCACAG 2409 

ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC TTCAGCTCTT 2469 

TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC ACTTGAAATG 2529 

TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA TTTAAAAAGT 2589 

TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT AGTAAGAATG 2649 

GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT TCTTTTGTAG 2709 

CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT TTAATTTAAC 2769 

ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA TTGACTGCCC 2829 

ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA ATTAGAAAAA 2889 

AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT ACTGTGTGCT 2 94 9 

TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT AACCATATGA 3009 

TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT TACTGATTAT 3069 

TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3113 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala Leu Phe Ser 
15 10 15 

Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr Gin Pro Ser Ser 
20 25 30 



Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu Lys Val Ser Trp 
35 40 45 



lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu Asp Asp 
50 55 60 



Leu Val lie Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr Phe lie 
65 70 75 80 

Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr Ala Val 
85 90 95 

lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg 
100 105 110 

Ser Ala Arg lie Ala Lys Gin Leu Glu Asn Asp Thr Arg lie lie Glu 
115 120 125 

Val Leu Cys Lys Glu His Glu Cys Asn lie Asp Glu Val Lys Asn Val 
130 135 140 

Tyr Phe Lys Asn Phe lie Pro Phe Met Asn Ser Leu Gly Leu Val Thr 
145 150 155 160 

Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu 
165 170 175 

lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His 
180 185 190 

Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe Glu Thr Gin Arg 
195 200 205 

Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val lie Pro Pro 
210 215 220 

His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin Gin Leu Met Met 
225 230 235 240 

lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu lie Ser Tyr 
245 250 255 

Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie Leu Lys Arg Val 
260 265 270 

Lys Asp lie Gly Tyr lie Phe Lys Glu Lys Phe Ala Lys Ala Val Gly 
275 280 285 

Gin Gly Cys Val Glu lie Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg 
290 295 300 

Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg 
305 310 315 320 



Leu Ser lie Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn lie Phe His 
325 330 335 



Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser 
340 345 350 



Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro 
355 360 365 

Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val 
370 375 380 

lie Glu Ser Phe lie Lys Ala Glu Gly Asn Leu Thr Arg Glu Met lie 
385 390 395 400 

Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser Leu Ala Trp 
405 410 415 

Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser Lys Asp Arg 
420 425 430 

Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro 
435 440 445 

Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg 
450 455 460 

Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn 
465 470 475 480 

Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys 
485 490 495 

Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr 
500 505 510 

Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His Pro Glu 
515 520 525 

Leu Glu His He He Trp Thr Leu Phe Gin His Thr Leu Gin Asn Glu 
530 535 540 

Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He Met Met Cys Ser 
545 550 555 560 

Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys Phe Lys He 
565 570 575 

He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe 
580 585 590 

Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He He Val Phe 
595 600 605 



Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He Leu Gin Tyr 
610 615 620 



Ala Ser Thr Arg 
625 

Ser Pro Tyr Lys 



lie Tyr lie Ser 
660 

Pro Thr Pro Thr 
675 

Gly Glu Ser Phe 
690 

Val Cys Asn Ser 
705 

Pro Pro Lys Pro 



Glu Ala Asp Gly 
740 

Lys Leu Ala Glu 
755 

Met Asn Asp Ser 
770 



Pro Pro Thr Leu 
630 

Phe Pro Ser Ser 
645 

Pro Leu Lys Ser 



Lys Met Thr Pro 
680 

Gly Thr Ser Glu 
695 

Asp Arg Val Leu 
710 

Leu Lys Lys Leu 
725 

Ser Lys His Leu 



Met Thr Ser Thr 
760 

Met Asp Thr Ser 
775 



Ser Pro lie Pro 
635 

Pro Leu Arg lie 
650 

Pro Tyr Lys lie 
665 

Arg Ser Arg lie 



Lys Phe Gin Lys 
700 

Lys Arg Ser Ala 
715 

Arg Phe Asp lie 
730 

Pro Gly Glu Ser 
745 

Arg Thr Arg Met 



Asn Lys Glu Glu 
780 



His lie Pro Arg 
640 

Pro Gly Gly Asn 
655 

Ser Glu Gly Leu 
670 

Leu Val Ser lie 
685 

lie Asn Gin Met 



Glu Gly Ser Asn 
720 

Glu Gly Ser Asp 
735 

Lys Phe Gin Gin 
750 

Gin Lys Gin Lys 
765 

Lys 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 2559 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 



GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 



96 



GTC GAC CTA GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC 144 
Val Asp Leu Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn 
35 40 45 

ATA GAA ATC AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT 192 
lie Glu lie Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp 
50 55 60 

ACC AGT ACC AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT 240 
Thr Ser Thr Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr 
65 70 75 

GAT GTA TTG TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT 288 
Asp Val Leu Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu 
80 85 90 

ATA TAT TTG ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT 336 
lie Tyr Leu Thr Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser 
95 100 105 110 

GCA TTG GTG CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG 384 
Ala Leu Val Leu Lys Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly 
115 120 125 

GAA GTA TTA CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG 432 
Glu Val Leu Gin Met Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met 
130 135 140 

CTA TGT GTC CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC 480 
Leu Cys Val Leu Asp Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu 
145 150 155 

AAA GAA CCA TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA 528 
Lys Glu Pro Tyr Lys Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg 
160 165 170 

ACA CCC AGG CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA 576 
Thr Pro Arg Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu 
175 180 185 190 

GAA AAT GAT ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT 624 
Glu Asn Asp Thr Arg lie lie Glu Val Leu Cys Lys Glu His Glu Cys 
195 200 205 

AAT ATA GAT GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT 672 
Asn lie Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe 
210 215 220 

ATG AAT TCT CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA 720 
Met Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu 
225 230 235 



AAT CTT TCT AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA 768 
Asn Leu Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu 
240 245 250 

GAT GCA AGA TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT 816 
Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser 
255 260 265 270 

ATA GAC AGT TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT 864 
lie Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp 
275 280 285 

GAA GAG GTG AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG 912 
Glu Glu Val Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met 
290 295 300 

AAC ACT ATC CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA 960 
Asn Thr lie Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin 
305 310 315 

CCT TCA GAA AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA 1008 
Pro Ser Glu Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro 
320 325 330 

AAA GAA AGT ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA 1056 
Lys Glu Ser lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys 
335 340 345 350 

GAG AAA TTT GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA 1104 
Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu lie Gly Ser 
355 360 365 

CAG CGA TAC AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC 1152 
Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser 
370 375 380 

ATG CTT AAA TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA 1200 
Met Leu Lys Ser Glu Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys 
385 390 395 

CTT CTG AAT GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT 1248 
Leu Leu Asn Asp Asn lie Phe His Met Ser Leu Leu Ala Cys Ala Leu 
400 405 410 

GAG GTT GTA ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT 12 96 

Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp 
415 420 425 430 

TCT GGA ACA GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA 1344 
Ser Gly Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu 
435 440 445 



AAA GCC TTT GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA 1392 
Lys Ala Phe Asp Phe Tyr Lys Val lie Glu Ser Phe lie Lys Ala Glu 
450 455 460 

GGC AAC TTG ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT 1440 
Gly Asn Leu Thr Arg Glu Met lie Lys His Leu Glu Arg Cys Glu His 
465 470 475 

CGA ATC ATG GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT 1488 
Arg lie Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp 
480 485 490 

CTT ATT AAA CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA 1536 
Leu lie Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu 
495 500 505 510 

TCT GCT TGT CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA 1584 
Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala 
515 520 525 

GAT ATG TAT CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT 1632 
Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr 
530 535 540 

ACG CGT GTA AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC 1680 
Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala 
545 550 555 

TTC CAG ACC CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT 1728 
Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr 
560 565 570 

AAA AAA GTG TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA 1776 
Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu 
575 580 585 590 

CGC CTT CTG TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT 1824 
Arg Leu Leu Ser Glu His Pro Glu Leu Glu His lie lie Trp Thr Leu 
595 600 605 

TTC CAG CAC ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT 18 72 

Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His 
610 615 620 

TTG GAC CAA ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG 1920 
Leu Asp Gin lie Met Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys 
625 630 635 

AAT ATA GAC CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT 1968 
Asn lie Asp Leu Lys Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu 
640 645 650 



CCT CAT GCT GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG 2016 
Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu 
655 660 665 670 

GAG TAT GAT TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA 2064 
Glu Tyr Asp Ser He He Val Phe Tyr Asn Ser Val Phe Met Gin Arg 
675 680 685 

CTG AAA ACA AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG 2112 
Leu Lys Thr Asn He Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu 
690 695 700 

TCA CCA ATA CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA 216 0 

Ser Pro He Pro His He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser 
705 710 715 

CCC TTA CGG ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT 2208 
Pro Leu Arg He Pro Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser 
720 725 730 

CCA TAT AAA ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA 2256 
Pro Tyr Lys He Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro 
735 740 745 750 

AGA TCA AGA ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG 23 04 

Arg Ser Arg He Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu 
755 760 765 

AAG TTC CAG AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC 2352 
Lys Phe Gin Lys He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu 
770 775 780 

AAA AGA AGT GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA 24 00 

Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu 
785 790 795 

CGC TTT GAT ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC 2448 
Arg Phe Asp He Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu 
800 805 810 

CCA GGA GAG TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT 2496 
Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr 
815 820 825 830 

CGA ACA CGA ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA 2544 
Arg Thr Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser 
835 840 845 

AAC AAG GAA GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT 2599 
Asn Lys Glu Glu Lys 
850 



GGATTCATTG TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC 2659 



ATTTAATATC TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA 2719 

TTCCTAAGCC ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA 2779 

AATCCTGCCA TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG 283 9 

CTTTATGGAT AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC 2 899 

TACTTTGCCT TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT 2 959 

GTATATTTTT TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA 3019 

TGCAATTTGA TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT 3079 

ATTATTAGAA ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG 313 9 

AATCTGATAT ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA 3199 

AGCAAAGTAT AACCATATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG 3259 

TAAAAGAACT TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA 3319 

TAGT 3323 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Val Asp 
20 25 30 

Leu Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu 
35 40 45 

lie Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser 
50 55 60 

Thr Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val 
65 70 75 80 

Leu Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr 
85 90 95 



Leu Thr Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu 
100 105 110 

Val Leu Lys Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val 
115 120 125 

Leu Gin Met Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys 
130 135 140 

Val Leu Asp Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu 
145 150 155 160 

Pro Tyr Lys Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg Thr Pro 
165 170 175 

Arg Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu Glu Asn 
180 185 190 

Asp Thr Arg lie lie Glu Val Leu Cys Lys Glu His Glu Cys Asn lie 
195 200 205 

Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn 
210 215 220 

Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu 
225 230 235 240 

Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala 
245 250 255 

Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp 
260 265 270 

Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu 
275 280 285 

Val Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr 
290 295 300 

lie Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser 
305 310 315 320 

Glu Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu 
325 330 335 

Ser lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys 
340 345 350 

Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu lie Gly Ser Gin Arg 
355 360 365 

Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu 
370 375 380 



Lys Ser Glu Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys Leu Leu 
385 390 395 400 



Asn Asp Asn lie Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val 
405 410 415 

Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly 
420 425 430 

Thr Asp Leu Ser Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala 
435 440 445 

Phe Asp Phe Tyr Lys Val lie Glu Ser Phe lie Lys Ala Glu Gly Asn 
450 455 460 

Leu Thr Arg Glu Met lie Lys His Leu Glu Arg Cys Glu His Arg lie 
465 470 475 480 

Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu lie 
485 490 495 

Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala 
500 505 510 

Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met 
515 520 525 

Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg 
530 535 540 

Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin 
545 550 555 560 

Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys 
565 570 575 

Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu 
580 585 590 

Leu Ser Glu His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin 
595 600 605 

His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp 
610 615 620 

Gin lie Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He 
625 630 635 640 

Asp Leu Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His 
645 650 655 



Ala Val Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr 
660 665 670 



Asp Ser lie lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys 
675 680 685 



Thr Asn lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro 
690 695 700 

lie Pro His lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu 
705 710 715 720 



Arg lie Pro Gly 



Lys lie Ser Glu 
740 

Arg lie Leu Val 
755 

Gin Lys lie Asn 
770 

Ser Ala Glu Gly 
785 

Asp lie Glu Gly 



Glu Ser Lys Phe 
820 

Arg Met Gin Lys 
835 



Gly Asn lie Tyr 
725 

Gly Leu Pro Thr 



Ser lie Gly Glu 
760 

Gin Met Val Cys 
775 

Ser Asn Pro Pro 
790 

Ser Asp Glu Ala 
805 

Gin Gin Lys Leu 



Gin Lys Met Asn 
840 



lie Ser Pro Leu 
730 

Pro Thr Lys Met 
745 

Ser Phe Gly Thr 



Asn Ser Asp Arg 
780 

Lys Pro Leu Lys 
795 

Asp Gly Ser Lys 
810 

Ala Glu Met Thr 
825 

Asp Ser Met Asp 



Lys Ser Pro Tyr 
735 

Thr Pro Arg Ser 
750 

Ser Glu Lys Phe 
765 

Val Leu Lys Arg 



Lys Leu Arg Phe 
800 

His Leu Pro Gly 
815 

Ser Thr Arg Thr 
830 

Thr Ser Asn Lys 
845 



Glu Glu Lys 
850 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 2697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 



GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 

GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTG GTC GAC 240 
Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Val Asp 
65 70 75 

CTA GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA 288 
Leu Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu 
80 85 90 

ATC AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT 336 
lie Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser 
95 100 105 110 

ACC AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA 384 
Thr Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val 
115 120 125 

TTG TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT 432 
Leu Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr 
130 135 140 

TTG ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG 480 
Leu Thr Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu 
145 150 155 

GTG CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA 528 
Val Leu Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val 
160 165 170 

TTA CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT 576 
Leu Gin Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys 
175 180 185 190 

GTC CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA 624 
Val Leu Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu 
195 200 205 

CCA TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC 672 
Pro Tyr Lys Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro 
210 215 220 



AGG CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT 72 0 

Arg Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu Glu Asn 
225 230 235 

GAT ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA 768 
Asp Thr Arg He He Glu Val Leu Cys Lys Glu His Glu Cys Asn He 
240 245 250 

GAT GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT 816 
Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn 
255 260 265 270 

TCT CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT 864 
Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu 
275 280 285 

TCT AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA 912 
Ser Lys Arg Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala 
290 295 300 

AGA TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC 960 
Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp 
305 310 315 

AGT TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG 1008 
Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu 
320 325 330 

GTG AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT 1056 
Val Asn Val He Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr 
335 340 345 350 

ATC CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA 1104 
He Gin Gin Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser 
355 360 365 

GAA AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA 1152 
Glu Asn Leu He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu 
370 375 380 

AGT ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA 1200 
Ser He Leu Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys 
385 390 395 

TTT GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA 1248 
Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg 
400 405 410 

TAC AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT 1296 
Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu 
415 420 425 430 



AAA TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG 1344 
Lys Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu 
435 440 445 

AAT GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT 13 92 

Asn Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val 
450 455 460 

GTA ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA 1440 
Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly 
465 470 475 

ACA GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC 1488 
Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala 
480 485 490 

TTT GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC 1536 
Phe Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn 
495 500 505 510 

TTG ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC 1584 
Leu Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He 
515 520 525 

ATG GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT 1632 
Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He 
530 535 540 

AAA CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT 1680 
Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala 
545 550 555 

TGT CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG 1728 
Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met 
560 565 570 

TAT CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT 1776 
Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg 
575 580 585 590 

GTA AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG 1824 
Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin 
595 600 605 

ACC CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA 1872 
Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys 
610 615 620 

GTG TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT 1920 
Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu 
625 630 635 



CTG TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG 1968 
Leu Ser Glu His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin 
640 645 650 

CAC ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC 2016 
His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp 
655 660 665 670 

CAA ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA 2064 
Gin lie Met Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie 
675 680 685 

GAC CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT 2112 
Asp Leu Lys Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His 
690 695 700 

GCT GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT 2160 
Ala Val Gin Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr 
705 710 715 

GAT TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA 2208 
Asp Ser lie lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys 
720 725 730 

ACA AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA 2256 
Thr Asn lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro 
735 740 745 750 

ATA CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA 23 04 

lie Pro His lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu 
755 760 765 

CGG ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT 2352 
Arg lie Pro Gly Gly Asn lie Tyr lie Ser Pro Leu Lys Ser Pro Tyr 
770 775 780 

AAA ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA 24 00 

Lys lie Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser 
785 790 795 

AGA ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC 2448 
Arg lie Leu Val Ser lie Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe 
800 805 810 

CAG AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA 24 96 

Gin Lys lie Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg 
815 820 825 830 

AGT GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT 2544 
Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe 
835 840 845 



GAT ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA 
Asp lie Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly 
850 855 860 



2592 



GAG TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA 2640 
Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr 
865 870 875 

CGA ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG 26 88 
Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys 
880 885 890 

GAA GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT 2737 

Glu Glu Lys 

895 

GGATTCATTG TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC 2797 

ATTTAATATC TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA 2 857 

TTCCTAAGCC ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA 2917 

AATCCTGCCA TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG 2977 

CTTTATGGAT AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC 3 037 

TACTTTGCCT TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT 3 0 97 

GTATATTTTT TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA 3157 

TGCAATTTGA TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT 3217 

ATTATTAGAA ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG 32 77 

AATCTGATAT ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA 333 7 

AGCAAAGTAT AACCATATGA TACTATCATA CTACTGAAAC AGATTTCATA C CTC AGAATG 3397 

TAAAAGAACT TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA 3457 

TAGT 3461 

(2) INFORMATION FOR SEQ ID N0:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Val Asp Leu Asp 
65 70 75 80 

Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu lie Ser 
85 90 95 

Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser Thr Lys 
100 105 110 

Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe 
115 120 125 

Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr 
130 135 140 

Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu 
145 150 155 160 

Lys Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin 
165 170 175 

Met Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val Leu 
180 185 190 

Asp Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr 
195 200 205 

Lys Thr Ala Val lie Pro lie Asn Gly Ser Pro Arg Thr Pro Arg Arg 
210 215 220 

Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu Glu Asn Asp Thr 
225 230 235 240 

Arg lie lie Glu Val Leu Cys Lys Glu His Glu Cys Asn lie Asp Glu 
245 250 255 



Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn Ser Leu 
260 265 270 



Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys 
275 280 285 



Arg Tyr Glu Glu lie 
290 

Phe Leu Asp His Asp 
305 

Glu Thr Gin Arg Thr 
325 

Val lie Pro Pro His 
340 

Gin Leu Met Met He 
355 

Leu He Ser Tyr Phe 
370 

Leu Lys Arg Val Lys 
385 

Lys Ala Val Gly Gin 
405 

Leu Gly Val Arg Leu 
420 

Glu Glu Glu Arg Leu 
435 

Asn He Phe His Met 
450 

Ala Thr Tyr Ser Arg 
465 

Leu Ser Phe Pro Trp 
485 

Phe Tyr Lys Val He 
500 

Arg Glu Met He Lys 
515 

Ser Leu Ala Trp Leu 
530 



Tyr Leu Lys Asn Lys Asp 
295 

Lys Thr Leu Gin Thr Asp 

310 315 

Pro Arg Lys Ser Asn Leu 
330 

Thr Pro Val Arg Thr Val 
345 

Leu Asn Ser Ala Ser Asp 
360 

Asn Asn Cys Thr Val Asn 
375 

Asp He Gly Tyr He Phe 
390 395 

Gly Cys Val Glu He Gly 
410 

Tyr Tyr Arg Val Met Glu 
425 

Ser He Gin Asn Phe Ser 
440 

Ser Leu Leu Ala Cys Ala 
455 

Ser Thr Ser Gin Asn Leu 
470 475 

He Leu Asn Val Leu Asn 
490 

Glu Ser Phe He Lys Ala 
505 

His Leu Glu Arg Cys Glu 
520 

Ser Asp Ser Pro Leu Phe 
535 



Leu Asp Ala Arg Leu 
300 

Ser He Asp Ser Phe 
320 

Asp Glu Glu Val Asn 
335 

Met Asn Thr He Gin 
350 

Gin Pro Ser Glu Asn 
365 

Pro Lys Glu Ser He 
380 

Lys Glu Lys Phe Ala 
400 

Ser Gin Arg Tyr Lys 
415 

Ser Met Leu Lys Ser 
430 

Lys Leu Leu Asn Asp 
445 

Leu Glu Val Val Met 
460 

Asp Ser Gly Thr Asp 
480 

Leu Lys Ala Phe Asp 
495 

Glu Gly Asn Leu Thr 
510 

His Arg He Met Glu 
525 

Asp Leu He Lys Gin 
540 



Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro 
545 550 555 560 



Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu 
565 570 575 



Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn 
580 585 590 

Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin 
595 600 605 

Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr 
610 615 620 

Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser 
625 630 635 640 

Glu His Pro Glu Leu Glu His lie He Trp Thr Leu Phe Gin His Thr 
645 650 655 

Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He 
660 665 670 

Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu 
675 680 685 

Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val 
690 695 700 

Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser 
705 710 715 720 

He He Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn 
725 730 735 

He Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro 
740 745 750 

His He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He 
755 760 765 

Pro Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He 

770 775 780 

Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He 
785 790 795 800 

Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys 
805 810 815 

He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala 
820 825 830 



Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He 
835 840 845 



Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser 
850 855 860 



Lys Phe Gin Gin 
865 

Gin Lys Gin Lys 
Lys 



Lys Leu Ala Glu 
870 

Met Asn Asp Ser 
885 



Met Thr Ser Thr 
875 

Met Asp Thr Ser 
890 



Arg Thr Arg Met 
880 

Asn Lys Glu Glu 
895 



(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 7.. 2583 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 

GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 

GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 240 
Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 



GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 
Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys 
80 85 90 



288 



AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 336 
Lys Lys Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu 
95 100 105 110 

GTC GAA TCT ACT GAA ATA AAT TCT GCA TTG GTG CTA AAA GTT TCT TGG 384 
Val Glu Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys Val Ser Trp 
115 120 125 

ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA CAA ATG GAA GAT GAT 432 
He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu Asp Asp 
130 135 140 

CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC CTT GAC TAT TTT ATT 480 
Leu Val He Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr Phe He 
145 150 155 

AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA TAT AAA ACA GCT GTT 528 
Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr Ala Val 
160 165 170 

ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG CGA GGT CAG AAC AGG 576 
He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg 
175 180 185 190 

AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT ACA AGA ATT ATT GAA 624 
Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg He He Glu 
195 200 205 

GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT GAG GTG AAA AAT GTT 672 
Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val Lys Asn Val 
210 215 220 

TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT CTT GGA CTT GTA ACA 72 0 

Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu Val Thr 
225 230 235 

TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT AAA CGA TAC GAA GAA 768 
Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu 
240 245 250 

ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA TTA TTT TTG GAT CAT 816 
He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His 
255 260 265 270 

GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT TTT GAA ACA CAG AGA 864 
Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu Thr Gin Arg 
275 280 285 

ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG AAT GTA ATT CCT CCA 912 
Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val He Pro Pro 
290 295 300 



CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC CAA CAA TTA ATG ATG 960 
His Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin Leu Met Met 
305 310 315 

ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA AAT CTG ATT TCC TAT 1008 
He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu He Ser Tyr 
320 325 330 

TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT ATA CTG AAA AGA GTG 1056 
Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu Lys Arg Val 
335 340 345 350 

AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT GCT AAA GCT GTG GGA 1104 
Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala Val Gly 
355 360 365 

CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC AAA CTT GGA GTT CGC 1152 
Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg 
370 375 380 

TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA TCA GAA GAA GAA CGA 12 00 

Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg 
385 390 395 

TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT GAC AAC ATT TTT CAT 1248 
Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn He Phe His 
400 405 410 

ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA ATG GCC ACA TAT AGC 1296 
Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser 
415 420 425 430 

AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA GAT TTG TCT TTC CCA 1344 
Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro 
435 440 445 

TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT GAT TTT TAC AAA GTG 1392 
Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val 
450 455 460 

ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG ACA AGA GAA ATG ATA 144 0 

He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg Glu Met He 
465 470 475 

AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG GAA TCC CTT GCA TGG 1488 
Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser Leu Ala Trp 
480 485 490 

CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA CAA TCA AAG GAC CGA 1536 
Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser Lys Asp Arg 
495 500 505 510 



GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT CCT CTT AAT CTT CCT 1584 
Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro 
515 520 525 



CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT CTT TCT CCT GTA AGA 
Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg 
530 535 540 

TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA AAT TCT ACT GCA AAT 
Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn 
545 550 555 

GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC CAG AAG CCA TTG AAA 
Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys 
560 565 570 

TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG TAT CGG CTA GCC TAT 
Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr 
575 580 585 590 

CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG TCT GAG CAC CCA GAA 
Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His Pro Glu 
595 600 605 

TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC ACC CTG CAG AAT GAG 
Leu Glu His He He Trp Thr Leu Phe Gin His Thr Leu Gin Asn Glu 
610 615 620 

TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA ATT ATG ATG TGT TCC 
Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He Met Met Cys Ser 
625 630 635 

ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC CTT AAA TTC AAA ATC 
Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys Phe Lys He 
640 645 650 

ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT GTT CAG GAG ACA TTC 
He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe 
655 660 665 670 

AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT TCT ATT ATA GTA TTC 
Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He He Val Phe 
675 680 685 

TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA AAT ATT TTG CAG TAT 
Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He Leu Gin Tyr 
690 695 700 

GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA CCT CAC ATT CCT CGA 
Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro His He Pro Arg 
705 710 715 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG ATT CCT GGA GGG AAC 
Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg lie Pro Gly Gly Asn 
720 725 730 



2208 



ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA ATT TCA GAA GGT CTG 
He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser Glu Gly Leu 
735 740 745 750 

CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA ATC TTA GTA TCA ATT 
Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu Val Ser He 
755 760 765 



GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT GCT GAA GGA AGC AAC 
Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly Ser Asn 
785 790 795 



AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA ATG CAA AAG CAG AAA 
Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys Gin Lys 
835 840 845 

ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA GAG AAA TGAGGATCTC 
Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
850 855 



2256 



2304 



GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG AAA ATA AAT CAG ATG 2352 
Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He Asn Gin Met 
770 775 780 



2400 



CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT ATT GAA GGA TCA GAT 2448 
Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu Gly Ser Asp 
800 805 810 

GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG TCC AAA TTT CAG CAG 2496 
Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe Gin Gin 
815 820 825 830 



2544 



2593 



AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG TCTCTCACAG ATGTGACTGT 2653 
ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC TTCAGCTCTT TTTGTGGATA 2713 
TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC ACTTGAAATG TTAGTCATTG 
TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA TTTAAAAAGT TGTAGCAGAT 
TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT AGTAAGAATG GCCCTAGAGT 
GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT TCTTTTGTAG CATATAGGTG 2953 
ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT TTAATTTAAC ATGAACACCC 
TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA TTGACTGCCC ATTCACCAAA 
ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA ATTAGAAAAA AATTACTAAT 



2773 
2833 
2893 



3013 
3073 
3133 



TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT ACTGTGTGCT TGTTTTATAA 3193 

AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT AAC CAT ATGA TACTATCATA 32 53 

CTAC TGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT TACTGATTAT TTTCTTCATC 3313 

CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3347 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 859 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
1 5 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys He Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 

Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu Val Glu 
100 105 110 

Ser Thr Glu He Asn Ser Ala Leu Val Leu Lys Val Ser Trp He Thr 
115 120 125 

Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met Glu Asp Asp Leu Val 
130 135 140 

He Ser Phe Gin Leu Met Leu Cys Val Leu Asp Tyr Phe He Lys Leu 
145 150 155 160 

Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys Thr Ala Val He Pro 
165 170 175 



He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg Ser Ala 
180 185 190 



Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg He He Glu Val Leu 
195 200 205 

Cys Lys Glu His Glu Cys Asn He Asp Glu Val Lys Asn Val Tyr Phe 
210 215 220 

Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu Val Thr Ser Asn 
225 230 235 240 

Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu He Tyr 
245 250 255 

Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His Asp Lys 
260 265 270 

Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu Thr Gin Arg Thr Pro 
275 280 285 

Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val He Pro Pro His Thr 
290 295 300 

Pro Val Arg Thr Val Met Asn Thr He Gin Gin Leu Met Met He Leu 
305 310 315 320 

Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu He Ser Tyr Phe Asn 
325 330 335 

Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu Lys Arg Val Lys Asp 
340 345 350 

He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala Val Gly Gin Gly 
355 360 365 

Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr 
370 375 380 

Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg Leu Ser 
385 390 395 400 

He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn He Phe His Met Ser 
405 410 415 

Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser Arg Ser 
420 425 430 

Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro Trp He 
435 440 445 



Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val He Glu 
450 455 460 



Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg Glu Met He Lys His 
465 470 475 480 



Leu Glu Arg Cys Glu His Arg He Met Glu Ser Leu Ala Trp Leu Ser 
485 490 495 

Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser Lys Asp Arg Glu Gly 
500 505 510 

Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin 
515 520 525 

Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg Ser Pro 
530 535 540 

Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn Ala Glu 
545 550 555 560 

Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr 
565 570 575 

Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg 
580 585 590 

Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His Pro Glu Leu Glu 
595 600 605 

His He He Trp Thr Leu Phe Gin His Thr Leu Gin Asn Glu Tyr Glu 
610 615 620 

Leu Met Arg Asp Arg His Leu Asp Gin He Met Met Cys Ser Met Tyr 
625 630 635 640 

Gly He Cys Lys Val Lys Asn He Asp Leu Lys Phe Lys He He Val 
645 650 655 

Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe Lys Arg 
660 665 670 

Val Leu He Lys Glu Glu Glu Tyr Asp Ser He He Val Phe Tyr Asn 
675 680 685 

Ser Val Phe Met Gin Arg Leu Lys Thr Asn He Leu Gin Tyr Ala Ser 
650 695 700 

Thr Arg Pro Pro Thr Leu Ser Pro He Pro His He Pro Arg Ser Pro 
705 710 715 720 

Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro Gly Gly Asn He Tyr 
725 730 735 

He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser Glu Gly Leu Pro Thr 
740 745 750 



Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu Val Ser He Gly Glu 
755 760 765 



Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He Asn Gin Met Val Cys 
770 775 780 

Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro 
785 790 795 800 

Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu Gly Ser Asp Glu Ala 
805 810 815 

Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu 
820 825 830 

Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys Gin Lys Met Asn 
835 840 845 



Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
850 855 



(2) INFORMATION" FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3161 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 7.. 2397 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 

GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 

GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 



AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 240 
Lys Leu Lys He Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 

GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 288 
Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys 
80 85 90 

AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 336 
Lys Lys Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu 
95 100 105 110 

GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG CGA GGT CAG 384 
Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin 
115 120 125 

AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT ACA AGA ATT 432 
Asn Arg Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg He 
130 135 140 

ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT GAG GTG AAA 48 0 

He Glu Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val Lys 
145 150 155 

AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT CTT GGA CTT 528 
Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu 
160 165 170 

GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT AAA CGA TAC 576 
Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr 
175 180 185 190 

GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA TTA TTT TTG 624 
Glu Glu He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu 
155 200 205 

GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT TTT GAA ACA 672 
Asp His Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu Thr 
210 215 220 

CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG AAT GTA ATT 720 
Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val He 
225 230 235 

CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC CAA CAA TTA 768 
Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin Leu 
240 245 250 

ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA AAT CTG ATT 816 
Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu He 
255 260 265 270 



TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT ATA CTG AAA 864 
Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie Leu Lys 
275 280 285 

AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT GCT AAA GCT 912 
Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala 
290 295 300 

GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC AAA CTT GGA 960 
Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly 
305 310 315 

GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA TCA GAA GAA 1008 
Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu 
3 20 325 330 

GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT GAC AAC ATT 1056 
Glu Arg Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn He 
335 340 345 350 

TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA ATG GCC ACA 1104 
Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr 
355 360 365 



TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA GAT TTG TCT 
Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser 
370 375 380 



AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG ACA AGA GAA 
Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg Glu 
400 405 410 

ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG GAA TCC CTT 
Met He Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser Leu 
415 420 425 430 



GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT CCT CTT AAT 
Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn 
450 455 460 

CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT CTT TCT CCT 
Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro 
465 470 475 



1152 



TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT GAT TTT TAC 1200 
Phe Pro Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr 
385 390 395 



1248 



1296 



GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA CAA TCA AAG 1344 
Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser Lys 
435 440 445 



1392 



1440 



GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA AAT TCT ACT 1488 
Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr 
480 485 490 

GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC CAG AAG CCA 1536 
Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro 
495 500 505 510 

TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG TAT CGG CTA 1584 
Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu 
515 520 525 

GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG TCT GAG CAC 1632 
Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu Ser Glu His 
530 535 540 

CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC ACC CTG CAG 1680 
Pro Glu Leu Glu His He He Trp Thr Leu Phe Gin His Thr Leu Gin 
545 550 555 

AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA ATT ATG ATG 1728 
Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin He Met Met 
560 565 570 

TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC CTT AAA TTC 1776 
Cys Ser Met Tyr Gly He Cys Lys Val Lys Asn He Asp Leu Lys Phe 
575 580 585 590 

AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT GTT CAG GAG 1824 
Lys He He Val Thr Ala Tyr Lys Asp Leu Pro His Ala Val Gin Glu 
595 600 605 

ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT TCT ATT ATA 1872 
Thr Phe Lys Arg Val Leu He Lys Glu Glu Glu Tyr Asp Ser He He 
610 615 620 

GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA AAT ATT TTG 192 0 

Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr Asn He Leu 
625 630 635 

CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA CCT CAC ATT 1968 
Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro He Pro His He 
640 645 650 

CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG ATT CCT GGA 2016 
Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg He Pro Gly 
655 660 665 670 

GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA ATT TCA GAA 2064 
Gly Asn He Tyr He Ser Pro Leu Lys Ser Pro Tyr Lys He Ser Glu 
67 5 680 685 



GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA ATC TTA GTA 2112 
Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg He Leu Val 
690 695 700 

TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG AAA ATA AAT 2160 
Ser He Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin Lys He Asn 
705 710 715 

CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT GCT GAA GGA 22 08 

Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser Ala Glu Gly 
720 725 730 

AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT ATT GAA GGA 2256 
Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu Gly 
735 740 745 750 

TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG TCC AAA TTT 2304 
Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys Phe 
755 760 765 

CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA ATG CAA AAG 2352 
Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg Met Gin Lys 
770 775 780 

CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA GAG AAA 2397 
Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
785 790 795 

TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG TCTCTCACAG 2457 

ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC TTCAGCTCTT 2 517 

TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC ACTTGAAATG 2577 

TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA TTTAAAAAGT 2637 

TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT AGTAAGAATG 2697 

GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT TCTTTTGTAG 2757 

CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT TTAATTTAAC 2817 

ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA TTGACTGCCC 2877 

ATTC AC C AAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA ATTAGAAAAA 2 937 

AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT ACTGTGTGCT 2997 

TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT AACCATATGA 3057 

TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT TACTGATTAT 3117 

TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3161 



(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 

Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu Ala Val 
100 105 110 

He Pro He Asn Gly Ser Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg 
115 120 125 

Ser Ala Arg He Ala Lys Gin Leu Glu Asn Asp Thr Arg He He Glu 
130 135 140 

Val Leu Cys Lys Glu His Glu Cys Asn He Asp Glu Val Lys Asn Val 
i45 150 155 160 

Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly Leu Val Thr 
165 170 175 

Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu 
180 185 190 

He Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His 
195 200 205 



Asp Lys Thr Leu Gin Thr Asp Ser He Asp Ser Phe Glu Thr Gin Arg 
210 215 220 



Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val lie Pro Pro 
225 230 235 240 



His Thr Pro Val Arg Thr Val Met Asn Thr He Gin Gin Leu Met Met 
245 250 255 

He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu He Ser Tyr 
260 265 270 

Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser He Leu Lys Arg Val 
275 280 285 

Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe Ala Lys Ala Val Gly 
290 295 300 

Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg 
305 310 315 320 

Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg 
325 330 335 

Leu Ser He Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn He Phe His 
340 345 350 

Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser 
355 360 365 

Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro 
370 375 380 

Trp He Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val 
385 390 395 400 

He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg Glu Met He 
405 410 415 

Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser Leu Ala Trp 
420 425 430 

Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser Lys Asp Arg 
435 440 445 

Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro 
450 455 460 

Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg 
465 470 475 480 

Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn 
485 490 495 



Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys 
500 505 510 



Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr 
515 520 525 



Leu Arg Leu Asn Thr 
530 

Leu Glu His He lie 
545 

Tyr Glu Leu Met Arg 
565 

Met Tyr Gly He Cys 
580 

He Val Thr Ala Tyr 
595 

Lys Arg Val Leu He 
610 

Tyr Asn Ser Val Phe 
625 

Ala Ser Thr Arg Pro 
645 

Ser Pro Tyr Lys Phe 
660 

He Tyr He Ser Pro 
675 

Pro Thr Pro Thr Lys 
690 

Gly Glu Ser Phe Gly 
705 

Val Cys Asn Ser Asp 
725 

Pro Pro Lys Pro Leu 
740 

Glu Ala Asp Gly Ser 
755 

Lys Leu Ala Glu Met 
770 



Leu Cys Glu Arg Leu Leu 
535 

Trp Thr Leu Phe Gin His 
550 555 

Asp Arg His Leu Asp Gin 
570 

Lys Val Lys Asn He Asp 
585 

Lys Asp Leu Pro His Ala 
600 

Lys Glu Glu Glu Tyr Asp 
615 

Met Gin Arg Leu Lys Thr 
630 635 

Pro Thr Leu Ser Pro He 
650 

Pro Ser Ser Pro Leu Arg 
665 

Leu Lys Ser Pro Tyr Lys 
680 

Met Thr Pro Arg Ser Arg 
695 

Thr Ser Glu Lys Phe Gin 
710 715 

Arg Val Leu Lys Arg Ser 
730 

Lys Lys Leu Arg Phe Asp 
745 

Lys His Leu Pro Gly Glu 
760 

Thr Ser Thr Arg Thr Arg 
775 



Ser Glu His Pro Glu 
540 

Thr Leu Gin Asn Glu 
560 

He Met Met Cys Ser 
575 

Leu Lys Phe Lys He 
590 

Val Gin Glu Thr Phe 
605 

Ser He He Val Phe 
620 

Asn He Leu Gin Tyr 
640 

Pro His He Pro Arg 
655 

He Pro Gly Gly Asn 
670 

He Ser Glu Gly Leu 
685 

He Leu Val Ser He 
700 

Lys He Asn Gin Met 
720 

Ala Glu Gly Ser Asn 
735 

He Glu Gly Ser Asp 
750 

Ser Lys Phe Gin Gin 
765 

Met Gin Lys Gin Lys 
780 



Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu Glu Lys 
785 790 795 



(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 7.. 2613 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 

GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 

GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 144 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 

CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 240 
Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 

GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 2 88 

Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys 
80 85 90 

AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 336 
Lys Lys Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu 
95 100 105 110 

GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 384 
Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He 
H5 120 125 

AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 432 
Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr 
130 135 140 

AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 4 80 

Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
145 150 155 



TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 528 
Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu 
160 165 170 

ACA CAA CCC AGC AGT TCG ATG GTC GCT GTT ATA CCC ATT AAT GGT TCA 576 
Thr Gin Pro Ser Ser Ser Met Val Ala Val He Pro He Asn Gly Ser 
175 180 185 190 

CCT CGA ACA CCC AGG CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA 624 
Pro Arg Thr Pro Arg Arg Gly Gin Asn Arg Ser Ala Arg He Ala Lys 
195 200 205 

CAA CTA GAA AAT GAT ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT 672 
Gin Leu Glu Asn Asp Thr Arg He He Glu Val Leu Cys Lys Glu His 
210 215 220 

GAA TGT AAT ATA GAT GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA 72 0 

Glu Cys Asn He Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe He 
225 230 235 

CCT TTT ATG AAT TCT CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG 768 
Pro Phe Met Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu 
240 245 250 

GTT GAA AAT CTT TCT AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA 816 
Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu He Tyr Leu Lys Asn Lys 
255 260 265 270 

GAT CTA GAT GCA AGA TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT 864 
Asp Leu Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr 
275 280 285 

GAT TCT ATA GAC AGT TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC 912 
Asp Ser He Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn 
290 295 300 

CTT GAT GAA GAG GTG AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT 960 
Leu Asp Glu Glu Val Asn Val He Pro Pro His Thr Pro Val Arg Thr 
305 310 315 

GTT ATG AAC ACT ATC CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT 1008 
Val Met Asn Thr He Gin Gin Leu Met Met He Leu Asn Ser Ala Ser 
320 325 330 

GAT CAA CCT TCA GAA AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG 1056 
Asp Gin Pro Ser Glu Asn Leu He Ser Tyr Phe Asn Asn Cys Thr Val 
335 340 345 350 

AAT CCA AAA GAA AGT ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC 1104 
Asn Pro Lys Glu Ser He Leu Lys Arg Val Lys Asp He Gly Tyr He 
355 360 365 



TTT AAA GAG AAA TTT GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT 
Phe Lys Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu He 
370 375 380 

GGA TCA CAG CGA TAC AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG 
Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met 
385 390 395 

GAA TCC ATG CTT AAA TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT 
Glu Ser Met Leu Lys Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe 
400 405 410 

AGC AAA CTT CTG AAT GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC 
Ser Lys Leu Leu Asn Asp Asn He Phe His Met Ser Leu Leu Ala Cys 
415 420 425 430 

GCT CTT GAG GTT GTA ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT 
Ala Leu Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn 
435 440 445 

CTT GAT TCT GGA ACA GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT 
Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu 
450 455 460 

AAT TTA AAA GCC TTT GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA 
Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val He Glu Ser Phe lie Lys 
465 470 475 

GCA GAA GGC AAC TTG ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT 
Ala Glu Gly Asn Leu Thr Arg Glu Met He Lys His Leu Glu Arg Cys 
480 485 490 

GAA CAT CGA ATC ATG GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA 
Glu His Arg He Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu 
495 500 505 510 

TTT GAT CTT ATT AAA CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC 
Phe Asp Leu He Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His 
515 520 525 

CTT GAA TCT GCT TGT CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT 
Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr 
530 535 540 

GCA GCA GAT ATG TAT CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT 
Ala Ala Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly 
545 550 555 

TCA ACT ACG CGT GTA AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC 
Ser Thr Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr 
560 $65 57Q 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1632 



1680 



1728 



TCA GCC TTC CAG ACC CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG 1776 
Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu 
575 580 585 590 

TTT TAT AAA AAA GTG TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT 1824 
Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu 
595 600 605 

TGT GAA CGC CTT CTG TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG 1872 
Cys Glu Arg Leu Leu Ser Glu His Pro Glu Leu Glu His He He Trp 
610 615 620 

ACC CTT TTC CAG CAC ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC 1920 
Thr Leu Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp 
625 630 635 

AGG CAT TTG GAC CAA ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA 1968 
Arg His Leu Asp Gin He Met Met Cys Ser Met Tyr Gly He Cys Lys 
640 645 650 

GTG AAG AAT ATA GAC CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG 2016 
Val Lys Asn He Asp Leu Lys Phe Lys He He Val Thr Ala Tyr Lys 
655 660 665 670 

GAT CTT CCT CAT GCT GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA 2 064 

Asp Leu Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu He Lys 
675 680 685 

GAA GAG GAG TAT GAT TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG 2112 
Glu Glu Glu Tyr Asp Ser He He Val Phe Tyr Asn Ser Val Phe Met 
690 695 700 

CAG AGA CTG AAA ACA AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT 2160 
Gin Arg Leu Lys Thr Asn He Leu Gin Tyr Ala Ser Thr Arg Pro Pro 
705 710 715 

ACC TTG TCA CCA ATA CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT 22 08 

Thr Leu Ser Pro He Pro His He Pro Arg Ser Pro Tyr Lys Phe Pro 
720 725 730 

AGT TCA CCC TTA CGG ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG 2256 
Ser Ser Pro Leu Arg He Pro Gly Gly Asn He Tyr He Ser Pro Leu 
735 740 745 750 

AAG AGT CCA TAT AAA ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG 2304 
Lys Ser Pro Tyr Lys He Ser Glu Gly Leu Pro Thr Pro Thr Lys Met 
755 760 765 

ACT CCA AGA TCA AGA ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT 2352 
Thr Pro Arg Ser Arg He Leu Val Ser He Gly Glu Ser Phe Gly Thr 
770 775 780 
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AAG 
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AAT 
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ATG 
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TGT 
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nap 


CGT 


2400 
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Glu 


Lys 


Phe 


Gin 


Lys 


He 


Asn 
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ft 7i 7\ 
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Leu 


Lys 
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Glu 
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He 


Glu 
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His 
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CGA 
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ATG 
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AAG 
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ATG 


AAT 


GAT 
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855 
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TGAGGATCTC AGGACCTTGG TGGACACTGT 
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Ser 
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Glu 


Glu 
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865 

GTACACCTCT GGATTCATTG TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT 2 703 

TTATGGCCAC ATTTAATATC TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT 2763 

GTTTGGGTGA TTCCTAAGCC ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA 2823 

ATCTTGTGTA AATCCTGCCA TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA 2883 

AATTGCTGTG CTTTATGGAT AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG 2943 

CCTGTCTGAC TACTTTGCCT TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT 3003 

TAATTTATAT GTATATTTTT TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT 3063 

ATCTTCCAAA TGCAATTTGA TTGACTGCCC ATTCACCAAA ATTATCCTGA ACTCTTCTGC 3123 

AAAAATGGAT ATTATTAGAA ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT 3183 

TTACTATTGG AATCTGATAT ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT 3243 

AAAAGCTGGA AGCAAAGTAT AACCATATGA TACTATCATA CTACTGAAAC AGATTTCATA 33 03 

CCTCAGAATG TAAAAGAACT TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA 3363 

GGATTATTGA TAGT 3377 



(2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
1 5 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys He Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 

Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu Asp Glu 
100 105 110 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser Val 
H5 120 125 

His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val 
130 135 140 

Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
145 150 155 160 

Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu Thr Gin 
165 170 175 

Pro Ser Ser Ser Met Val Ala Val He Pro He Asn Gly Ser Pro Arg 
180 185 190 

Thr Pro Arg Arg Gly Gin Asn Arg Ser Ala Arg He Ala Lys Gin Leu 
195 200 205 

Glu Asn Asp Thr Arg He He Glu Val Leu Cys Lys Glu His Glu Cys 
210 215 220 



Asn lie Asp Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe 
225 230 235 240 



Met Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu 
245 250 255 

Asn Leu Ser Lys Arg Tyr Glu Glu He Tyr Leu Lys Asn Lys Asp Leu 
260 265 270 

Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser 
275 280 285 

He Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp 
290 295 300 

Glu Glu Val Asn Val He Pro Pro His Thr Pro Val Arg Thr Val Met 
305 310 315 320 

Asn Thr He Gin Gin Leu Met Met He Leu Asn Ser Ala Ser Asp Gin 
325 330 335 

Pro Ser Glu Asn Leu He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro 
340 345 350 

Lys Glu Ser He Leu Lys Arg Val Lys Asp He Gly Tyr He Phe Lys 
355 360 365 

Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser 
370 375 380 

Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser 
38 5 390 395 400 

Met Leu Lys Ser Glu Glu Glu Arg Leu Ser He Gin Asn Phe Ser Lys 
405 410 415 

Leu Leu Asn Asp Asn He Phe His Met Ser Leu Leu Ala Cys Ala Leu 
420 425 430 

Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp 
435 440 445 

Ser Gly Thr Asp Leu Ser Phe Pro Trp He Leu Asn Val Leu Asn Leu 
450 455 460 

Lys Ala Phe Asp Phe Tyr Lys Val He Glu Ser Phe He Lys Ala Glu 
465 470 475 480 

Gly Asn Leu Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His 
485 490 495 



Arg He Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp 
500 505 510 



Leu He Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu 
515 520 525 



Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala 
530 535 540 

Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr 
545 550 555 560 

Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala 
565 570 575 

Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr 
580 585 590 

Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu 
5^5 600 605 

Arg Leu Leu Ser Glu His Pro Glu Leu Glu His He He Trp Thr Leu 
610 615 620 

Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His 
625 630 635 640 

Leu Asp Gin He Met Met Cys Ser Met Tyr Gly He Cys Lys Val Lys 
^45 650 655 

Asn He Asp Leu Lys Phe Lys He He Val Thr Ala Tyr Lys Asp Leu 
660 665 670 

Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu He Lys Glu Glu 
675 680 685 

Glu Tyr Asp Ser He He Val Phe Tyr Asn Ser Val Phe Met Gin Arg 
690 695 700 

Leu Lys Thr Asn He Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu 
705 710 715 720 

Ser Pro He Pro His He Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser 
725 730 735 

Pro Leu Arg He Pro Gly Gly Asn He Tyr He Ser Pro Leu Lys Ser 
740 745 750 

Pro Tyr Lys He Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro 
755 760 765 

Arg Ser Arg He Leu Val Ser He Gly Glu Ser Phe Gly Thr Ser Glu 
770 775 780 

Lys Phe Gin Lys He Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu 
785 790 795 800 



Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu 
805 810 815 



Arg Phe Asp lie Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu 
820 825 830 



Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr 
835 840 845 



Arg Thr Arg Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser 
850 855 860 



Asn Lys Glu Glu Lys 
865 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3383 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 7.. 2619 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 



Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 



GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 



96 



GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 



144 



CTT GAG TTT GAA GAA ACA GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 



192 



AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 
Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 



240 



GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 288 
Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys 
80 85 90 



AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 336 
Lys Lys Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp Leu 
95 100 105 110 

GAT GAG ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 384 
Asp Glu Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu lie 
115 120 125 

AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 432 
Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser Thr 
130 135 140 

AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 48 0 

Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
145 150 155 

TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 528 
Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu 
160 165 170 

ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG 576 
Thr Gin Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val 
175 180 185 190 

CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA 624 
Leu Lys Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu 
195 200 205 

CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC 6 72 

Gin Met Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val 
210 215 220 

CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA 720 
Leu Asp Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro 
225 230 235 

TAT AAA ACA GGG TCG AAT TCT CTT GGA CTT GTA ACA TCT AAT GGA CTT 768 
Tyr Lys Thr Gly Ser Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu 
240 245 250 

CCA GAG GTT GAA AAT CTT TCT AAA CGA TAC GAA GAA ATT TAT CTT AAA 816 
Pro Glu Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys 
255 260 265 270 

AAT AAA GAT CTA GAT GCA AGA TTA TTT TTG GAT CAT GAT AAA ACT CTT 864 
Asn Lys Asp Leu Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu 
275 280 285 

CAG ACT GAT TCT ATA GAC AGT TTT GAA ACA CAG AGA ACA CCA CGA AAA 912 
Gin Thr Asp Ser lie Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys 
290 295 300 



AGT AAC CTT GAT GAA GAG GTG AAT GTA ATT CCT CCA CAC ACT CCA GTT 960 
Ser Asn Leu Asp Glu Glu Val Asn Val lie Pro Pro His Thr Pro Val 
305 310 315 

AGG ACT GTT ATG AAC ACT ATC CAA CAA TTA ATG ATG ATT TTA AAT TCA 1008 
Arg Thr Val Met Asn Thr lie Gin Gin Leu Met Met lie Leu Asn Ser 
320 325 330 

GCA AGT GAT CAA CCT TCA GAA AAT CTG ATT TCC TAT TTT AAC AAC TGC 1056 
Ala Ser Asp Gin Pro Ser Glu Asn Leu lie Ser Tyr Phe Asn Asn Cys 
335 340 345 350 

ACA GTG AAT CCA AAA GAA AGT ATA CTG AAA AGA GTG AAG GAT ATA GGA 1104 
Thr Val Asn Pro Lys Glu Ser lie Leu Lys Arg Val Lys Asp lie Gly 
355 360 365 

TAC ATC TTT AAA GAG AAA TTT GCT AAA GCT GTG GGA CAG GGT TGT GTC 1152 
Tyr He Phe Lys Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val 
370 375 380 

GAA ATT GGA TCA CAG CGA TAC AAA CTT GGA GTT CGC TTG TAT TAC CGA 1200 
Glu He Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg 
385 390 395 

GTA ATG GAA TCC ATG CTT AAA TCA GAA GAA GAA CGA TTA TCC ATT CAA 1248 
Val Met Glu Ser Met Leu Lys Ser Glu Glu Glu Arg Leu Ser He Gin 
400 405 410 

AAT TTT AGC AAA CTT CTG AAT GAC AAC ATT TTT CAT ATG TCT TTA TTG 1296 
Asn Phe Ser Lys Leu Leu Asn Asp Asn He Phe His Met Ser Leu Leu 
415 420 425 430 

GCG TGC GCT CTT GAG GTT GTA ATG GCC ACA TAT AGC AGA AGT ACA TCT 1344 
Ala Cys Ala Leu Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser 
435 440 445 

CAG AAT CTT GAT TCT GGA ACA GAT TTG TCT TTC CCA TGG ATT CTG AAT 13 92 

Gin Asn Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro Trp He Leu Asn 
450 455 460 

GTG CTT AAT TTA AAA GCC TTT GAT TTT TAC AAA GTG ATC GAA AGT TTT 1440 
Val Leu Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val He Glu Ser Phe 
465 470 475 

ATC AAA GCA GAA GGC AAC TTG ACA AGA GAA ATG ATA AAA CAT TTA GAA 1488 
He Lys Ala Glu Gly Asn Leu Thr Arg Glu Met He Lys His Leu Glu 
480 485 490 

CGA TGT GAA CAT CGA ATC ATG GAA TCC CTT GCA TGG CTC TCA GAT TCA 1536 
Arg Cys Glu His Arg He Met Glu Ser Leu Ala Trp Leu Ser Asp Ser 
495 500 505 510 



CCT TTA TTT GAT CTT ATT AAA CAA TCA AAG GAC CGA GAA GGA CCA ACT 1584 
Pro Leu Phe Asp Leu lie Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr 
515 520 525 

GAT CAC CTT GAA TCT GCT TGT CCT CTT AAT CTT CCT CTC CAG AAT AAT 1632 
Asp His Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn 
530 535 540 

CAC ACT GCA GCA GAT ATG TAT CTT TCT CCT GTA AGA TCT CCA AAG AAA 168 0 

His Thr Ala Ala Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys 
545 550 555 

AAA GGT TCA ACT ACG CGT GTA AAT TCT ACT GCA AAT GCA GAG ACA CAA 1728 
Lys Gly Ser Thr Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin 
560 565 570 

GCA ACC TCA GCC TTC CAG ACC CAG AAG CCA TTG AAA TCT ACC TCT CTT 1776 
Ala Thr Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu 
575 580 585 590 

TCA CTG TTT TAT AAA AAA GTG TAT CGG CTA GCC TAT CTC CGG CTA AAT 1824 
Ser Leu Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn 
595 600 605 

ACA CTT TGT GAA CGC CTT CTG TCT GAG CAC CCA GAA TTA GAA CAT ATC 1872 
Thr Leu Cys Glu Arg Leu Leu Ser Glu His Pro Glu Leu Glu' His lie 
610 615 620 

ATC TGG ACC CTT TTC CAG CAC ACC CTG CAG AAT GAG TAT GAA CTC ATG 192 0 

lie Trp Thr Leu Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met 
625 630 635 

AGA GAC AGG CAT TTG GAC CAA ATT ATG ATG TGT TCC ATG TAT GGC ATA 1968 
Arg Asp Arg His Leu Asp Gin lie Met Met Cys Ser Met Tyr Gly lie 
640 645 650 

TGC AAA GTG AAG AAT ATA GAC CTT AAA TTC AAA ATC ATT GTA ACA GCA 2016 
Cys Lys Val Lys Asn lie Asp Leu Lys Phe Lys He He Val Thr Ala 
655 660 665 670 

TAC AAG GAT CTT CCT CAT GCT GTT CAG GAG ACA TTC AAA CGT GTT TTG 2 064 

Tyr Lys Asp Leu Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu 
675 680 685 

ATC AAA GAA GAG GAG TAT GAT TCT ATT ATA GTA TTC TAT AAC TCG GTC 2112 
He Lys Glu Glu Glu Tyr Asp Ser He He Val Phe Tyr Asn Ser Val 
690 695 700 

TTC ATG CAG AGA CTG AAA ACA AAT ATT TTG CAG TAT GCT TCC ACC AGG 2160 
Phe Met Gin Arg Leu Lys Thr Asn He Leu Gin Tyr Ala Ser Thr Arg 
705 710 715 



CCC CCT ACC TTG TCA CCA ATA CCT CAC ATT CCT CGA AGC CCT TAC AAG 2208 
Pro Pro Thr Leu Ser Pro lie Pro His lie Pro Arg Ser Pro Tyr Lys 
720 725 730 

TTT CCT AGT TCA CCC TTA CGG ATT CCT GGA GGG AAC ATC TAT ATT TCA 2256 
Phe Pro Ser Ser Pro Leu Arg lie Pro Gly Gly Asn lie Tyr lie Ser 
735 740 745 750 

CCC CTG AAG AGT CCA TAT AAA ATT TCA GAA GGT CTG CCA ACA CCA ACA 2304 
Pro Leu Lys Ser Pro Tyr Lys lie Ser Glu Gly Leu Pro Thr Pro Thr 
755 760 765 

AAA ATG ACT CCA AGA TCA AGA ATC TTA GTA TCA ATT GGT GAA TCA TTC 2352 
Lys Met Thr Pro Arg Ser Arg lie Leu Val Ser He Gly Glu Ser Phe 
770 775 780 

GGG ACT TCT GAG AAG TTC CAG AAA ATA AAT CAG ATG GTA TGT AAC AGC 2400 
Gly Thr Ser Glu Lys Phe Gin Lys He Asn Gin Met Val Cys Asn Ser 
785 790 795 

GAC CGT GTG CTC AAA AGA AGT GCT GAA GGA AGC AAC CCT CCT AAA CCA 2448 
Asp Arg Val Leu Lys Arg Ser Ala Glu Gly Ser Asn Pro Pro Lys Pro 
800 805 810 

CTG AAA AAA CTA CGC TTT GAT ATT GAA GGA TCA GAT GAA GCA GAT GGA 2496 
Leu Lys Lys Leu Arg Phe Asp He Glu Gly Ser Asp Glu Ala Asp Gly 
815 820 825 830 

AGT AAA CAT CTC CCA GGA GAG TCC AAA TTT CAG CAG AAA CTG GCA GAA 2544 
Ser Lys His Leu Pro Gly Glu Ser Lys Phe Gin Gin Lys Leu Ala Glu 
835 840 845 

ATG ACT TCT ACT CGA ACA CGA ATG CAA AAG CAG AAA ATG AAT GAT AGC 2592 
Met Thr Ser Thr Arg Thr Arg Met Gin Lys Gin Lys Met Asn Asp Ser 
850 855 860 

ATG GAT ACC TCA AAC AAG GAA GAG AAA TGAGGATCTC AGGACCTTGG 2639 
Met Asp Thr Ser Asn Lys Glu Glu Lys 
865 870 

TGGACACTGT GTACACCTCT GGATTCATTG TCTCTCACAG ATGTGACTGT ATAACTTTCC 2699 

CAGGTTCTGT TTATGGCCAC ATTTAATATC TTCAGCTCTT TTTGTGGATA TAAAATGTGC 2759 

AGATGCAATT GTTTGGGTGA TTCCTAAGCC ACTTGAAATG TTAGTCATTG TTATTTATAC 2819 

AAGATTGAAA ATCTTGTGTA AATCCTGCCA TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT 28 79 

TCCAAAGTAA AATTGCTGTG CTTTATGGAT AGTAAGAATG GCCCTAGAGT GGGAGTCCTG 2939 

ATAACCCAGG CCTGTCTGAC TACTTTGCCT TCTTTTGTAG CATATAGGTG ATGTTTGCTC 2999 

TTGTTTTTAT TAATTTATAT GTATATTTTT TTAATTTAAC ATGAACACCC TTAGAAAATG 3059 



TGTCCTATCT ATCTTCCAAA TGCAATTTGA TTGACTGCCC ATTCACCAAA ATTATCCTGA 3119 

ACTCTTCTGC AAAAATGGAT ATTATTAGAA ATTAGAAAAA AATTACTAAT TTTACACATT 3179 

AGATTTTATT TTACTATTGG AATCTGATAT ACTGTGTGCT TGTTTTATAA AATTTTGCTT 323 9 

TTAATTAAAT AAAAGCTGGA AGCAAAGTAT AAC CATATGA TACTATCATA CTACTGAAAC 3299 

AGATTTCATA CCTCAGAATG TAAAAGAACT TACTGATTAT TTTCTTCATC CAACTTATGT 3359 

TTTTAAATGA GGATTATTGA TAGT 3383 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 

Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 

Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu Asp Glu 
100 105 110 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He Ser Val 
115 120 125 

His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr Lys Val 
130 135 140 



Asp Asn 
145 



Ala Met Ser Arg Leu Leu Lys 
150 



Lys Tyr Asp Val Leu Phe Ala 
155 160 



Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr Gin 
165 170 175 

Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu Lys 
180 185 190 

Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
195 200 205 

Glu Asp Asp Leu Val lie Ser Phe Gin Leu Met Leu Cys Val Leu Asp 
210 215 220 

Tyr Phe lie Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro Tyr Lys 
225 230 235 240 

Thr Gly Ser Asn Ser Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu 
245 250 255 

Val Glu Asn Leu Ser Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys 
260 265 270 

Asp Leu Asp Ala Arg Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr 
275 280 285 

Asp Ser lie Asp Ser Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn 
290 295 300 

Leu Asp Glu Glu Val Asn Val lie Pro Pro His Thr Pro Val Arg Thr 
305 310 315 320 

Val Met Asn Thr lie Gin Gin Leu Met Met lie Leu Asn Ser Ala Ser 
325 330 335 

Asp Gin Pro Ser Glu Asn Leu lie Ser Tyr Phe Asn Asn Cys Thr Val 
340 345 350 

Asn Pro Lys Glu Ser lie Leu Lys Arg Val Lys Asp lie Gly Tyr lie 
355 360 365 

Phe Lys Glu Lys Phe Ala Lys Ala Val Gly Gin Gly Cys Val Glu lie 
370 375 380 

Gly Ser Gin Arg Tyr Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met 
385 390 395 400 

Glu Ser Met Leu Lys Ser Glu Glu Glu Arg Leu Ser lie Gin Asn Phe 
405 410 415 

Ser Lys Leu Leu Asn Asp Asn lie Phe His Met Ser Leu Leu Ala Cys 
420 425 430 

Ala Leu Glu Val Val Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn 
435 440 445 



Leu Asp Ser Gly Thr Asp Leu Ser Phe Pro Trp lie Leu Asn Val Leu 
450 455 460 



Asn Leu Lys Ala Phe Asp Phe Tyr Lys Val lie Glu Ser Phe lie Lys 
465 470 475 480 

Ala Glu Gly Asn Leu Thr Arg Glu Met lie Lys His Leu Glu Arg Cys 
485 490 495 

Glu His Arg lie Met Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu 
500 505 510 

Phe Asp Leu lie Lys Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His 
515 520 525 

Leu Glu Ser Ala Cys Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr 
530 535 540 

Ala Ala Asp Met Tyr Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly 
545 550 555 560 

Ser Thr Thr Arg Val Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr 
565 570 575 

Ser Ala Phe Gin Thr Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu 
580 585 590 

Phe Tyr Lys Lys Val Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu 
595 600 605 

Cys Glu Arg Leu Leu Ser Glu His Pro Glu Leu Glu His lie lie Trp 
610 615 620 

Thr Leu Phe Gin His Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp 
625 630 635 640 

Arg His Leu Asp Gin lie Met Met Cys Ser Met Tyr Gly lie Cys Lys 
645 650 655 

Val Lys Asn lie Asp Leu Lys Phe Lys lie lie Val Thr Ala Tyr Lys 
660 665 670 

Asp Leu Pro His Ala Val Gin Glu Thr Phe Lys Arg Val Leu lie Lys 
675 680 685 

Glu Glu Glu Tyr Asp Ser lie lie Val Phe Tyr Asn Ser Val Phe Met 
690 695 700 

Gin Arg Leu Lys Thr Asn He Leu Gin Tyr Ala Ser Thr Arg Pro Pro 
705 710 715 720 



Thr Leu Ser Pro He Pro His He Pro Arg Ser Pro Tyr Lys Phe Pro 
725 730 735 



Ser Ser Pro Leu Arg 
740 

Lys Ser Pro Tyr Lys 
755 

Thr Pro Arg Ser Arg 
770 

Ser Glu Lys Phe Gin 
785 

Val Leu Lys Arg Ser 
805 

Lys Leu Arg Phe Asp 
820 

His Leu Pro Gly Glu 
835 

Ser Thr Arg Thr Arg 
850 

Thr Ser Asn Lys Glu 
865 



lie Pro Gly Gly Asn lie 
745 

lie Ser Glu Gly Leu Pro 
760 

He Leu Val Ser He Gly 
775 

Lys lie Asn Gin Met Val 
790 795 

Ala Glu Gly Ser Asn Pro 
810 

He Glu Gly Ser Asp Glu 
825 

Ser Lys Phe Gin Gin Lys 
840 

Met Gin Lys Gin Lys Met 
855 

Glu Lys 
870 



Tyr He Ser Pro Leu 
750 

Thr Pro Thr Lys Met 
765 

Glu Ser Phe Gly Thr 
780 

Cys Asn Ser Asp Arg 
800 

Pro Lys Pro Leu Lys 
815 

Ala Asp Gly Ser Lys 
830 

Leu Ala Glu Met Thr 
845 

Asn Asp Ser Met Asp 
860 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 7. .2790 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

CGCGTC ATG CCG CCC AAA ACC CCC CGA AAA ACG GCC GCC ACC GCC GCC 48 
Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala 
15 10 

GCT GCC GCC GCG GAA CCC CCG GCA CCG CCG CCG CCG CCC CCT CCT GAG 96 
Ala Ala Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu 
15 20 25 30 



GAG GAC CCA GAG CAG GAC AGC GGC CCG GAG GAC CTG CCT CTC GTC AGG 
Glu Asp Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg 
35 40 45 



144 



CTT GAG TTT GAA GAA AC A GAA GAA CCT GAT TTT ACT GCA TTA TGT CAG 192 
Leu Glu Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin 
50 55 60 

AAA TTA AAG ATA CCA GAT CAT GTC AGA GAG AGA GCT TGG TTA ACT TGG 240 
Lys Leu Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp 
65 70 75 

GAG AAA GTT TCA TCT GTG GAT GGA GTA TTG GGA GGT TAT ATT CAA AAG 2 88 

Glu Lys Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr He Gin Lys 
80 85 90 

AAA AAG GAA CTG TGG GGA ATC TGT ATC TTT ATT GCA GCA GTT GAC CTA 336 
Lys Lys Glu Leu Trp Gly He Cys He Phe He Ala Ala Val Asp Leu 
95 100 105 110 

GGT GAT ATG TCG TTC ACT TTT ACT GAG CTA CAG AAA AAC ATA GAA ATC 384 
Gly Asp Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn He Glu He 
115 120 125 

AGT GTC CAT AAA TTC TTT AAC TTA CTA AAA GAA ATT GAT ACC AGT ACC 432 
Ser Val His Lys Phe Phe Asn Leu Leu Lys Glu He Asp Thr Ser Thr 
130 135 140 

AAA GTT GAT AAT GCT ATG TCA AGA CTG TTG AAG AAG TAT GAT GTA TTG 4 80 

Lys Val Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu 
145 150 155 

TTT GCA CTC TTC AGC AAA TTG GAA AGG ACA TGT GAA CTT ATA TAT TTG 528 
Phe Ala Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu He Tyr Leu 
160 165 170 

ACA CAA CCC AGC AGT TCG ATA TCT ACT GAA ATA AAT TCT GCA TTG GTG 576 
Thr Gin Pro Ser Ser Ser He Ser Thr Glu He Asn Ser Ala Leu Val 
175 180 185 190 

CTA AAA GTT TCT TGG ATC ACA TTT TTA TTA GCT AAA GGG GAA GTA TTA 624 
Leu Lys Val Ser Trp He Thr Phe Leu Leu Ala Lys Gly Glu Val Leu 
195 200 205 

CAA ATG GAA GAT GAT CTG GTG ATT TCA TTT CAG TTA ATG CTA TGT GTC 672 
Gin Met Glu Asp Asp Leu Val He Ser Phe Gin Leu Met Leu Cys Val 
210 215 220 

CTT GAC TAT TTT ATT AAA CTC TCA CCT CCC ATG TTG CTC AAA GAA CCA 72 0 

Leu Asp Tyr Phe He Lys Leu Ser Pro Pro Met Leu Leu Lys Glu Pro 
225 230 235 

TAT AAA ACA GCT GTT ATA CCC ATT AAT GGT TCA CCT CGA ACA CCC AGG 768 
Tyr Lys Thr Ala Val He Pro He Asn Gly Ser Pro Arg Thr Pro Arg 
240 245 250 



CGA GGT CAG AAC AGG AGT GCA CGG ATA GCA AAA CAA CTA GAA AAT GAT 816 
Arg Gly Gin Asn Arg Ser Ala Arg lie Ala Lys Gin Leu Glu Asn Asp 
255 260 265 270 

ACA AGA ATT ATT GAA GTT CTC TGT AAA GAA CAT GAA TGT AAT ATA GAT 864 
Thr Arg lie lie Glu Val Leu Cys Lys Glu His Glu Cys Asn lie Asp 
275 280 285 

GAG GTG AAA AAT GTT TAT TTC AAA AAT TTT ATA CCT TTT ATG AAT TCT 912 
Glu Val Lys Asn Val Tyr Phe Lys Asn Phe lie Pro Phe Met Asn Ser 
290 295 300 

CTT GGA CTT GTA ACA TCT AAT GGA CTT CCA GAG GTT GAA AAT CTT TCT 960 
Leu Gly Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser 
305 310 315 

AAA CGA TAC GAA GAA ATT TAT CTT AAA AAT AAA GAT CTA GAT GCA AGA 1008 
Lys Arg Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg 
320 325 330 

TTA TTT TTG GAT CAT GAT AAA ACT CTT CAG ACT GAT TCT ATA GAC AGT 1056 
Leu Phe Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser 
335 340 345 350 

TTT GAA ACA CAG AGA ACA CCA CGA AAA AGT AAC CTT GAT GAA GAG GTG 1104 
Phe Glu Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val 
355 360 365 

AAT GTA ATT CCT CCA CAC ACT CCA GTT AGG ACT GTT ATG AAC ACT ATC 1152 
Asn Val lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr He 
370 375 380 

CAA CAA TTA ATG ATG ATT TTA AAT TCA GCA AGT GAT CAA CCT TCA GAA 1200 
Gin Gin Leu Met Met He Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu 
385 390 395 

AAT CTG ATT TCC TAT TTT AAC AAC TGC ACA GTG AAT CCA AAA GAA AGT 1248 
Asn Leu He Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser 
400 405 410 

ATA CTG AAA AGA GTG AAG GAT ATA GGA TAC ATC TTT AAA GAG AAA TTT 12 96 

He Leu Lys Arg Val Lys Asp He Gly Tyr He Phe Lys Glu Lys Phe 
415 420 425 430 

GCT AAA GCT GTG GGA CAG GGT TGT GTC GAA ATT GGA TCA CAG CGA TAC 1344 
Ala Lys Ala Val Gly Gin Gly Cys Val Glu He Gly Ser Gin Arg Tyr 
435 440 445 

AAA CTT GGA GTT CGC TTG TAT TAC CGA GTA ATG GAA TCC ATG CTT AAA 13 92 

Lys Leu Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys 
450 455 460 



TCA GAA GAA GAA CGA TTA TCC ATT CAA AAT TTT AGC AAA CTT CTG AAT 144 0 

Ser Glu Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys Leu Leu Asn 
465 470 475 

GAC AAC ATT TTT CAT ATG TCT TTA TTG GCG TGC GCT CTT GAG GTT GTA 1488 
Asp Asn lie Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val 
480 485 490 

ATG GCC ACA TAT AGC AGA AGT ACA TCT CAG AAT CTT GAT TCT GGA ACA 1536 
Met Ala Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr 
495 500 505 510 

GAT TTG TCT TTC CCA TGG ATT CTG AAT GTG CTT AAT TTA AAA GCC TTT 1584 
Asp Leu Ser Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe 
515 520 525 

GAT TTT TAC AAA GTG ATC GAA AGT TTT ATC AAA GCA GAA GGC AAC TTG 1632 
Asp Phe Tyr Lys Val lie Glu Ser Phe He Lys Ala Glu Gly Asn Leu 
530 535 540 

ACA AGA GAA ATG ATA AAA CAT TTA GAA CGA TGT GAA CAT CGA ATC ATG 1680 
Thr Arg Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met 
545 550 555 

GAA TCC CTT GCA TGG CTC TCA GAT TCA CCT TTA TTT GAT CTT ATT AAA 1728 
Glu Ser Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys 
560 565 570 

CAA TCA AAG GAC CGA GAA GGA CCA ACT GAT CAC CTT GAA TCT GCT TGT 1776 
Gin Ser Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys 
575 580 585 590 

CCT CTT AAT CTT CCT CTC CAG AAT AAT CAC ACT GCA GCA GAT ATG TAT 1824 
Pro Leu Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr 
595 600 605 

CTT TCT CCT GTA AGA TCT CCA AAG AAA AAA GGT TCA ACT ACG CGT GTA 1872 
Leu Ser Pro Val Arg Ser Pro Lys Lys Lys Gly Ser Thr Thr Arg Val 
610 615 620 

AAT TCT ACT GCA AAT GCA GAG ACA CAA GCA ACC TCA GCC TTC CAG ACC 1920 
Asn Ser Thr Ala Asn Ala Glu Thr Gin Ala Thr Ser Ala Phe Gin Thr 
625 630 635 

CAG AAG CCA TTG AAA TCT ACC TCT CTT TCA CTG TTT TAT AAA AAA GTG 1968 
Gin Lys Pro Leu Lys Ser Thr Ser Leu Ser Leu Phe Tyr Lys Lys Val 
640 645 650 

TAT CGG CTA GCC TAT CTC CGG CTA AAT ACA CTT TGT GAA CGC CTT CTG 2016 
Tyr Arg Leu Ala Tyr Leu Arg Leu Asn Thr Leu Cys Glu Arg Leu Leu 
655 660 665 670 



TCT GAG CAC CCA GAA TTA GAA CAT ATC ATC TGG ACC CTT TTC CAG CAC 2 064 

Ser Glu His Pro Glu Leu Glu His lie lie Trp Thr Leu Phe Gin His 
675 680 685 

ACC CTG CAG AAT GAG TAT GAA CTC ATG AGA GAC AGG CAT TTG GAC CAA 2112 
Thr Leu Gin Asn Glu Tyr Glu Leu Met Arg Asp Arg His Leu Asp Gin 
690 695 700 

ATT ATG ATG TGT TCC ATG TAT GGC ATA TGC AAA GTG AAG AAT ATA GAC 216 0 

lie Met Met Cys Ser Met Tyr Gly lie Cys Lys Val Lys Asn lie Asp 
705 710 715 

CTT AAA TTC AAA ATC ATT GTA ACA GCA TAC AAG GAT CTT CCT CAT GCT 2208 
Leu Lys Phe Lys lie lie Val Thr Ala Tyr Lys Asp Leu Pro His Ala 
720 725 730 

GTT CAG GAG ACA TTC AAA CGT GTT TTG ATC AAA GAA GAG GAG TAT GAT 22 56 

Val Gin Glu Thr Phe Lys Arg Val Leu lie Lys Glu Glu Glu Tyr Asp 
735 740 745 750 

TCT ATT ATA GTA TTC TAT AAC TCG GTC TTC ATG CAG AGA CTG AAA ACA 23 04 

Ser lie lie Val Phe Tyr Asn Ser Val Phe Met Gin Arg Leu Lys Thr 
755 760 765 

AAT ATT TTG CAG TAT GCT TCC ACC AGG CCC CCT ACC TTG TCA CCA ATA 2352 
Asn lie Leu Gin Tyr Ala Ser Thr Arg Pro Pro Thr Leu Ser Pro lie 
770 775 780 

CCT CAC ATT CCT CGA AGC CCT TAC AAG TTT CCT AGT TCA CCC TTA CGG 24 00 

Pro His lie Pro Arg Ser Pro Tyr Lys Phe Pro Ser Ser Pro Leu Arg 
785 790 795 

ATT CCT GGA GGG AAC ATC TAT ATT TCA CCC CTG AAG AGT CCA TAT AAA 2448 
lie Pro Gly Gly Asn lie Tyr lie Ser Pro Leu Lys Ser Pro Tyr Lys 
800 805 810 

ATT TCA GAA GGT CTG CCA ACA CCA ACA AAA ATG ACT CCA AGA TCA AGA 2496 
lie Ser Glu Gly Leu Pro Thr Pro Thr Lys Met Thr Pro Arg Ser Arg 
815 820 825 830 

ATC TTA GTA TCA ATT GGT GAA TCA TTC GGG ACT TCT GAG AAG TTC CAG 2544 
lie Leu Val Ser lie Gly Glu Ser Phe Gly Thr Ser Glu Lys Phe Gin 
835 840 845 

AAA ATA AAT CAG ATG GTA TGT AAC AGC GAC CGT GTG CTC AAA AGA AGT 2592 
Lys lie Asn Gin Met Val Cys Asn Ser Asp Arg Val Leu Lys Arg Ser 
850 855 860 

GCT GAA GGA AGC AAC CCT CCT AAA CCA CTG AAA AAA CTA CGC TTT GAT 2640 
Ala Glu Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp 
865 870 875 



ATT GAA GGA TCA GAT GAA GCA GAT GGA AGT AAA CAT CTC CCA GGA GAG 
He Glu Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu 
880 885 890 



2688 



TCC AAA TTT CAG CAG AAA CTG GCA GAA ATG ACT TCT ACT CGA ACA CGA 2 736 

Ser Lys Phe Gin Gin Lys Leu Ala Glu Met Thr Ser Thr Arg Thr Arg 
895 900 905 910 

ATG CAA AAG CAG AAA ATG AAT GAT AGC ATG GAT ACC TCA AAC AAG GAA 2784 
Met Gin Lys Gin Lys Met Asn Asp Ser Met Asp Thr Ser Asn Lys Glu 
915 920 925 

GAG AAA TGAGGATCTC AGGACCTTGG TGGACACTGT GTACACCTCT GGATTCATTG 284 0 

Glu Lys 



TCTCTCACAG ATGTGACTGT ATAACTTTCC CAGGTTCTGT TTATGGCCAC ATTTAATATC 290 0 

TTCAGCTCTT TTTGTGGATA TAAAATGTGC AGATGCAATT GTTTGGGTGA TTCCTAAGCC 296 0 

ACTTGAAATG TTAGTCATTG TTATTTATAC AAGATTGAAA ATCTTGTGTA AATCCTGCCA 302 0 

TTTAAAAAGT TGTAGCAGAT TGTTTCCTCT TCCAAAGTAA AATTGCTGTG CTTTATGGAT 3080 

AGTAAGAATG GCCCTAGAGT GGGAGTCCTG ATAACCCAGG CCTGTCTGAC TACTTTGCCT 314 0 

TCTTTTGTAG CATATAGGTG ATGTTTGCTC TTGTTTTTAT TAATTTATAT GTATATTTTT 32 00 

TTAATTTAAC ATGAACACCC TTAGAAAATG TGTCCTATCT ATCTTCCAAA TGCAATTTGA 3260 

TTGACTGCCC ATTCAC CAAA ATTATCCTGA ACTCTTCTGC AAAAATGGAT ATTATTAGAA 332 0 

ATTAGAAAAA AATTACTAAT TTTACACATT AGATTTTATT TTACTATTGG AATCTGATAT 33 80 

ACTGTGTGCT TGTTTTATAA AATTTTGCTT TTAATTAAAT AAAAGCTGGA AGCAAAGTAT 3440 

AAC CAT ATGA TACTATCATA CTACTGAAAC AGATTTCATA CCTCAGAATG TAAAAGAACT 3500 

TACTGATTAT TTTCTTCATC CAACTTATGT TTTTAAATGA GGATTATTGA TAGT 3554 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 



Met Pro Pro Lys Thr Pro Arg Lys Thr Ala Ala Thr Ala Ala Ala Ala 
15 10 15 



Ala Ala Glu Pro Pro Ala Pro Pro Pro Pro Pro Pro Pro Glu Glu Asp 
20 25 30 

Pro Glu Gin Asp Ser Gly Pro Glu Asp Leu Pro Leu Val Arg Leu Glu 
35 40 45 

Phe Glu Glu Thr Glu Glu Pro Asp Phe Thr Ala Leu Cys Gin Lys Leu 
50 55 60 

Lys lie Pro Asp His Val Arg Glu Arg Ala Trp Leu Thr Trp Glu Lys 
65 70 75 80 

Val Ser Ser Val Asp Gly Val Leu Gly Gly Tyr lie Gin Lys Lys Lys 
85 90 95 

Glu Leu Trp Gly lie Cys lie Phe lie Ala Ala Val Asp Leu Gly Asp 
100 105 110 

Met Ser Phe Thr Phe Thr Glu Leu Gin Lys Asn lie Glu lie Ser Val 
115 120 125 

His Lys Phe Phe Asn Leu Leu Lys Glu lie Asp Thr Ser Thr Lys Val 
130 135 140 

Asp Asn Ala Met Ser Arg Leu Leu Lys Lys Tyr Asp Val Leu Phe Ala 
145 150 155 160 

Leu Phe Ser Lys Leu Glu Arg Thr Cys Glu Leu lie Tyr Leu Thr Gin 
165 170 175 

Pro Ser Ser Ser lie Ser Thr Glu lie Asn Ser Ala Leu Val Leu Lys 
180 185 190 

Val Ser Trp lie Thr Phe Leu Leu Ala Lys Gly Glu Val Leu Gin Met 
195 200 205 



Glu Asp Asp Leu 
210 

Tyr Phe lie Lys 
225 

Thr Ala Val He 



Gin Asn Arg Ser 
260 

He He Glu Val 
275 



Val He Ser Phe 
215 

Leu Ser Pro Pro 
230 

Pro He Asn Gly 
245 

Ala Arg He Ala 



Leu Cys Lys Glu 
280 



Gin Leu Met Leu 
220 

Met Leu Leu Lys 
235 

Ser Pro Arg Thr 
250 

Lys Gin Leu Glu 
265 

His Glu Cys Asn 



Cys Val Leu Asp 



Glu Pro Tyr Lys 
240 

Pro Arg Arg Gly 
255 

Asn Asp Thr Arg 
270 

He Asp Glu Val 
285 



Lys Asn Val Tyr Phe Lys Asn Phe He Pro Phe Met Asn Ser Leu Gly 
290 295 300 



Leu Val Thr Ser Asn Gly Leu Pro Glu Val Glu Asn Leu Ser Lys Arg 
305 310 315 320 



Tyr Glu Glu lie Tyr Leu Lys Asn Lys Asp Leu Asp Ala Arg Leu Phe 
325 330 335 

Leu Asp His Asp Lys Thr Leu Gin Thr Asp Ser lie Asp Ser Phe Glu 
340 345 350 

Thr Gin Arg Thr Pro Arg Lys Ser Asn Leu Asp Glu Glu Val Asn Val 
355 360 365 

lie Pro Pro His Thr Pro Val Arg Thr Val Met Asn Thr lie Gin Gin 
370 375 380 

Leu Met Met lie Leu Asn Ser Ala Ser Asp Gin Pro Ser Glu Asn Leu 
385 390 395 400 

lie Ser Tyr Phe Asn Asn Cys Thr Val Asn Pro Lys Glu Ser lie Leu 
405 410 415 

Lys Arg Val Lys Asp lie Gly Tyr lie Phe Lys Glu Lys Phe Ala Lys 
420 425 430 

Ala Val Gly Gin Gly Cys Val Glu lie Gly Ser Gin Arg Tyr Lys Leu 
435 440 445 

Gly Val Arg Leu Tyr Tyr Arg Val Met Glu Ser Met Leu Lys Ser Glu 
450 455 460 

Glu Glu Arg Leu Ser lie Gin Asn Phe Ser Lys Leu Leu Asn Asp Asn 
465 470 475 480 

lie Phe His Met Ser Leu Leu Ala Cys Ala Leu Glu Val Val Met Ala 
485 490 495 

Thr Tyr Ser Arg Ser Thr Ser Gin Asn Leu Asp Ser Gly Thr Asp Leu 
500 505 510 

Ser Phe Pro Trp lie Leu Asn Val Leu Asn Leu Lys Ala Phe Asp Phe 
515 520 525 

Tyr Lys Val He Glu Ser Phe He Lys Ala Glu Gly Asn Leu Thr Arg 
530 535 540 

Glu Met He Lys His Leu Glu Arg Cys Glu His Arg He Met Glu Ser 
545 550 555 560 

Leu Ala Trp Leu Ser Asp Ser Pro Leu Phe Asp Leu He Lys Gin Ser 
565 570 575 



Lys Asp Arg Glu Gly Pro Thr Asp His Leu Glu Ser Ala Cys Pro Leu 
580 585 590 



Asn Leu Pro Leu Gin Asn Asn His Thr Ala Ala Asp Met Tyr Leu Ser 
595 600 605 



Pro Val Arg Ser Pro 
610 

Thr Ala Asn Ala Glu 
625 

Pro Leu Lys Ser Thr 
645 

Leu Ala Tyr Leu Arg 
660 

His Pro Glu Leu Glu 
675 

Gin Asn Glu Tyr Glu 
690 

Met Cys Ser Met Tyr 
705 

Phe Lys lie He Val 
725 

Glu Thr Phe Lys Arg 
740 

He Val Phe Tyr Asn 
755 

Leu Gin Tyr Ala Ser 
770 

He Pro Arg Ser Pro 
785 

Gly Gly Asn He Tyr 
805 

Glu Gly Leu Pro Thr 
820 

Val Ser He Gly Glu 
835 

Asn Gin Met Val Cys 
850 



Lys Lys Lys Gly Ser Thr 
615 

Thr Gin Ala Thr Ser Ala 
630 635 

Ser Leu Ser Leu Phe Tyr 
650 

Leu Asn Thr Leu Cys Glu 
665 

His He He Trp Thr Leu 
680 

Leu Met Arg Asp Arg His 
695 

Gly He Cys Lys Val Lys 
710 715 

Thr Ala Tyr Lys Asp Leu 
730 

Val Leu He Lys Glu Glu 
745 

Ser Val Phe Met Gin Arg 
760 

Thr Arg Pro Pro Thr Leu 
775 

Tyr Lys Phe Pro Ser Ser 
790 795 

He Ser Pro Leu Lys Ser 
810 

Pro Thr Lys Met Thr Pro 
825 

Ser Phe Gly Thr Ser Glu 
840 

Asn Ser Asp Arg Val Leu 
855 



Thr Arg Val Asn Ser 
620 

Phe Gin Thr Gin Lys 
640 

Lys Lys Val Tyr Arg 
655 

Arg Leu Leu Ser Glu 
670 

Phe Gin His Thr Leu 
685 

Leu Asp Gin He Met 
700 

Asn He Asp Leu Lys 
720 

Pro His Ala Val Gin 
735 

Glu Tyr Asp Ser He 
750 

Leu Lys Thr Asn He 
765 

Ser Pro He Pro His 
780 

Pro Leu Arg He Pro 
800 

Pro Tyr Lys He Ser 
815 

Arg Ser Arg He Leu 
830 

Lys Phe Gin Lys He 
845 

Lys Arg Ser Ala Glu 
860 



Gly Ser Asn Pro Pro Lys Pro Leu Lys Lys Leu Arg Phe Asp He Glu 
865 870 875 880 



Gly Ser Asp Glu Ala Asp Gly Ser Lys His Leu Pro Gly Glu Ser Lys 
885 890 895 



Phe Gin Gin Lys Leu Ala Glu Met 
900 

Lys Gin Lys Met Asn Asp Ser Met 
915 920 



Thr Ser Thr Arg Thr Arg Met Gin 
905 910 

Asp Thr Ser Asn Lys Glu Glu Lys 
925 
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